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FOOR-COLOR DMA SEQOENCING B Y 
FLODRBSCENT NOCLEOTIPg 



The invention 



USING CliBAV ABLE 
TERMIHilTORS 



disclosed herein was made with 
government support NIH grant P50-HG002806 . Accordingly, 
the U.S. Government has certain rights 



m this 



invention. 



Throughout this application, various publications are 
referenced in parentheses by number. Full citations 
for these references may be found at the end of the 
specification immediately preceding the claims. The 
disclosures of these publications in their entireties 
are hereby incorporated by reference into this 
application to more fully describe the state of the 
art to which this invention pertains. 

Background of tha lnir»>»«-8^« 



25 



30 



DNA sequencing is driving genomics research and 
discovery. The completion of the Human Genome Project 
has set the stage for screening genetic mutations to 
identify disease genes on a genome-wide scale (1) . 
Accurate high- throughput DNA sequencing methods are 
needed to explore the complete human genome sequence 
for applications in clinical medicine and health care. 
To overcome the limitations of the current 
electrophoresis-based sequencing technology (2-5), a 
variety of new DNA-sequencing methods have been 
investigated. Such approaches include sequencing by 
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hybridization (6) , mass spectrometry based sequencing 
(7-9), sequence-specific detection of single -stranded 
DNA using engineered nanopores (10) and sequencing by 
ligation (11). More recently, DNA sequencing by 
5 synthesis (SBS) approaches such as pyrosequencing 
(12) , sequencing of single DNA molecules (13) and 
polymerase colonies (14) have been widely explored. 

The concept of DNA sequencing by synthesis (SBS) was 
10 revealed in 1988 with an attempt to sequence DNA by 
detecting the pyrophosphate group that is generated 
when a nucleotide is incorporated in a DNA polymerase 
reaction (15) . Pyrosequencing which was developed 
based on this concept and an enzymatic cascade has 
15 been explored for genome sequencing (16). However, 
there are inherent difficulties in this method for 
determining the number of incorporated nucleotides in 
homopolymeric regions of the template. Additionally, 
each of the four nucleotides needs to be added and 
20 detected separately, which increases the overall 
detection time. The accumulation of un-degraded 
nucleotides and other components could also lower the 
accuracy of the method when sequencing a long DNA 
template. It is thus desirable to have a simple method 
25 to directly detect a reporter group attached to the 
nucleotide that is incorporated into a growing DNA 
strand in the polymerase reaction rather than relying 
on a complex enzymatic cascade. The SBS scheme based 
on fluorescence detection coupled with a chip format 
0 has the potential to markedly increase the throughput 
of DNA sequencing projects. Consequently, several 
groups have investigated such a system with an aim to 
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construct an ultra high- throughput DNA sequencing 
method (17-18>: Thus far, no complete success of using 
such a system to unambiguously sequence DNA has been 
published. 

5 

Previous work in the literature exploring the SBS 
method is mostly focused on designing and synthesizing 
a cleavable chemical moiety that is linked to a 
fluorescent dye to cap the 3' -OH group of the 
10 nucleotides (19-21). The rationale is that after the 
fluorophore is removed, the 3' -OH would be regenerated 
to allow subsequent nucleotide addition. However, no 
success has been reported for the incorporation of 
such a nucleotide with a cleavable fluorescent dye on 
15 the 3' position by DNA polymerase into a growing DNA 
strand. The reason is that the 3' position on the 
deoxyribose is very close to the amino acid residues 
in the active site of the polymerase, and the 
polymerase is therefore sensitive to modification in 
20 this area of the ribose ring, especially with a large 
fluorophore (22) • 
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Suamarv of the Invention 

This invention provides a method for determining the 
5 sequence of a DNA comprising performing the following 
steps for each residue of the DNA to be sequenced: 

(a) contacting the DNA with a DNA polymerase 
in the presence of (i) a primer and (ii) 
four nucleotide analogues under conditions 
permitting the DNA polymerase to catalyze 
DNA synthesis, wherein (i) the nucleotide 
analogues consist of an analogue of dGTP, 
an analogue of dCTP, an analogue of dTTP 
or dUTP, and an analogue of dATP, (2) each 
nucleotide analogue comprises (i) a base 
selected from the group consisting of 
adenine, guanine, cytosine, thymine or 
""cil, and analogues thereof, (ii) a 
deoxyr ibose, (iii) a moiety cleavably 
linked to the 3' -oxygen of the deoxyr ibose 
and (iv) a unique label cleavably linked 
to the base, so that a nucleotide analogue 
complementary to the residue being 
sequenced is incorporated into the DNA by 
the DNA polymerase, and (3) each of the 
four analogues has a unique label which is 
different than the unique labels of the 



(b> 

30 (c) 



removing unbound nucleotide analogues; 
again contacting the DNA with a DNA 
polymerase in the presence of (i) a primer 
and (ii) four reversible terminators under 
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(d) 

15 (e) 



20 



25 



(f) 




conditions permitting the DNA polymerase 
to catalyze DNA synthesis, wherein (1) the 
reversible terminators consist of an 
analogue of dGTP, an suialogue of dCTP, an 
analogue of dTTP or dUTP, and an analogue 
of dATP, (2) each nucleotide analogue 
comprises (i) a base selected from the 
group consisting of adenine, guanine, 
cytosine, thymine or uracil, and analogues 
thereof, which base does not have a unique 
label bound thereto, (ii) a deoxyribose. 
and (iii) a moiety cleavably linked to the 
3 '-oxygen of the deoxyribose; 
removing unbound reversible terminators; 

the identity of the nucleotide 
analogue incorporated in step (a) via 
determining the identity of the 
corresponding unique label. with the 
proviso that step (e) can either precede 
step (c) or follow step (d) ; and 
following step (e) , except with respect to 
the final DNA residue to be sequenced, 
cleaving from the incorporated nucleotide 
analogues the unique label, if applicable, 
and the moiety linked to the 3' -oxygen 
atom of the deoxyribose, 

the seqpience of the DNA. 





This invention also provides a kit for performing the 
30 method of claim i. comprising, in 
compartments , 
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) nucleotide analogues of (i) GTP, (ii) ATP, 
(iii) CTP and (iv) TTP or UTP, wherein 
each analogue comprises (i) a base 
selected from the group consisting of 
adenine, guanine, cytosine, thymine or 
uracil, or an analogue thereof, (ii) a 
deoxyribose, (iii) a cleavable moiety 
bound to the 3' -oxygen of the deoxyribose 
and (iv) a unique label bound to the base 
via a cleavable linker, 

reversible terminators comprising a 
nucleotide analogue of (i) GTP, (ii) atp, 
(iii) CTP and (iv) TTP or UTP, wherein 
each analogue comprises (i) a base 
selected from the group consisting of 
adenine, guanine, cytosine, thymine or 
uracil, or an euialogue thereof, which base 
does not have a unique label bound thereto, 
(ii) a deoxyribose, and (iii) a cleavable 
moiety bound to the 3' -oxygen of the 
deoxyribose ; 

reagents suitable for use in DNA 
polymerization; and 
instructions for use. 
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Brief Daacrlptlon of tha Fl( 

Figure 1. a chip is constructed with immobilized DNA 
5 templates that are able to self -prime for initiating 
the polymerase reaction. Four nucleotide analogues are 
designed such that each is labeled with a unique 
fluorescent dye on the specific location of the base 
through a cleavable linker, and a small chemically 
I reversible moiety (R) to cap the 3 ■ -OH group, upon 
adding the four nucleotide analogues and DNA 
polymerase, only the nucleotide analogue complementary 
to the next nucleotide on the template is incorporated 
by polymerase on each spot of the chip (step 1) . A 4 
color fluorescence imager is used to image the surface 
of the chip, and the unique fluorescence emission from 
the specific dye on the nucleotide analogues on each 
spot of the chip will yield the identity of the 
nucleotide (step 2). After imaging, the small amount 
of unreacted 3 • -OH group on the self -primed template 
moiety is capped by excess ddNTPs and DNA polymerase 
to avoid interference with the next round of synthesis 
or by 3'-0-allyl-dNTPs to synchronize the 
incorporation (step 3). The dye moiety and the R 
protecting group will be removed to generate a free 
3 '-OH group with high yield (step 4). The self -primed 
DNA moiety on the chip at this stage is ready for the 
next cycle of the reaction to identify the next 
nucleotide sequence of the template DNA (step 5) . 

Figure 2. Structures of 3 ' -o-allyl-dCTP-allyl-Bodipy- 
FL-510 (X,^. = 502 nm; . 510 nm) , 3'-0- 
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allyl-dUTP-allyl-R6G (Xab. (max) = 525 nm; Xe« (max) = 550 
nm) , 3' -O-allyl-dATP-allyl-ROX (Xabs (maxi = 585 nm; Xen 
(maxt = 602 nm) , and 3' -O-allyl-dGTP-aHyl-Bodipy-650 

(X.ab8 (max) • 630 nm; (max) 

« 650 nm) . 

5 

Figure 3. The polymerase extension scheme (left) and 
MALDI-TOF MS spectra of the four consecutive extension 
products and their deallylated products (right) . 
Primer extended with 3 ' -0-allyl-dUTP-allyl-R6G (1) , 
10 and its deallylated product 2; Product 2 extended with 
3' -O-allyl-dGTP-allyl-Bodipy-650 (3), and its 

deallylated product 4; Product 4 extended with 3'-0- 

ft 

allyl-dATP-allyl-ROX (5), and its deallylated product 
6; Product 6 extended with 3' -O-allyl-dCTP-allyl- 
15 Bodipy-FL-510 (7), and its deallylated product 8. 
After 3 0 seconds of incubation with the 
palladium/TPPTS cocktail at 70*'C, deallylation is 
complete with both the fluorophores and the 3'-0-allyl 
groups cleaved from the extended DNA products. 

20 

Figure 4, DNA extension reaction performed in solution 
phase to characterize the four different chemically 
cleavcODle fluorescent nucleotide analogues (3'-0- 
allyl-dUTP-allyl-R6G, 3 ' -O-allyl-dGTP-allyl-Bodipy-650, 

25 3' -O-allyl-dATP-allyl-ROX and 3' -O-allyl-dCTP-allyl- 
Bodipy-FL-510) . After each extension reaction, the DNA 
extension product is purified by HPLC for MALDI-TOF MS 
measurement to verify that it is the correct extension 
product. Pd-catalyzed deallylation reaction is 

30 performed to produce a DNA product that is used as a 
primer for the next DNA extension reaction. 
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Figure 5. Preparation of azide-functionalized glass 
chip through a PEG linker for the immobilization of 
alkyne labeled self -priming DNA template for SBS. 

Figure 6. (A) Reaction scheme of SBS on a chip using 
four chemically cleavable fluorescent nucleotides, (b) 
The scanned 4-color fluorescence images for each step 
of SBS on a chip: (i) incorporation of 3'-0-allyl- 
dGTP-allyl-CyS; (2) cleavage of allyl-CyS and 3'-allyl 
group; (3) incorporation of 3 ' -0-allyl-dATP-allyl-R0X; 
(4) cleavage of allyl-Rox and 3'-allyl group; (5) 
incorporation of 3 ' -0-allyl-dUTP-allyl-R6G; (6) 
cleavage of allyl-RSG and 3'-aHyl group; (7) 
15 incorporation of 3 ' -O-allyl-dCTP-allyl-Bodipy-FL-510; 
(8) cleavage of allyl-Bodipy-FL-5lO and 3'-allyl group; 
images (9) to (25) are similarly produced. (C) A plot' 
(4-color sequencing data) of raw fluorescence emission 
intensity at the four designated emission wavelength 
20 of the four chemically cleavable fluorescent 



Figure 7. Structures of 3 ' -O-allyl-dATP, 3'-0-allyl- 
dCTP, 3' -O-allyl-dGTP, and 3 ' -O-allyl-dTTP. 

25 

Figure 8. (A) 4-color DNA sequencing raw data with our 
sequencing by synthesis chemistry using a template 
containing two homopolymeric regions. The individual 
base (A. T, c, G) , the 10 repeated A' s and the 5 
30 repeated A's are clearly identified. The small groups 
of peaks between the identified bases are fluorescent 
background from the DNA chip, which does not build up 
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as the cycle continues. (B) The pyrosequencing data of 
the same DNA template containing the homopolymeric 
regions (lO T's and s T's). The first 4 individual 
bases are clearly identified. The two homopolymeric 
5 regions (10 A's) and (5 A's) produce two large peaks, 
making it very difficult to determine the exact 
sequence from the data. 

Figure 9. Single base extension reaction and MALDI-TOP 
10 MS of 3'-0-Allyl-dUTP-allyl-R6G. 

Figure 10. Single base extension reaction and MALDI- 
TOP MS of 3'-0-Allyl-dATP-allyl-R0X (39). 



15 



Figure 11. Single base extension reaction and MALDi- 
TOP MS of 3'-0-Allyl-dGTP-allyl-Bodipy-650 (43) and 
3' -O-allyl-dGTP-allyl-CyS (44). 



20 
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^ Pebailed Description of the Invention 
5 Terms 

As used herein, and unless stated otherwise, each of 
the following terms shall have the definition set 
forth below. 

Adenine ; 
Cytosine; 

Deoxyribonucleic acid; 
Guanine ; 

Ribonucleic acid; 
Thymine ; and 
Uracil. 

"Nucleic acid" shall mean any nucleic acid molecule, 
20 including, without limitation, DNA, RNA and hybrids 
thereof. The nucleic acid bases that form nucleic acid 
molecules can be the bases A, C, G, T and U, as well 
as derivatives thereof. Derivatives of these bases are 
well known in the art, and are exemplified in PCR 
25 Systems, Reagents and Consumables (Perkin Elmer 
Catalogue 1996-1997, Roche Molecular Systems, Inc., 
Branchburg, New Jersey, USA) . 

"Type" of nucleotide refers to A, G, C, T or U. 



A 

C 

DNA 



15 RNA 
T 
U 
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"Mass tag" shall mean a molecular entity of a 
predetermined size which is capable of being attached 
by a cleavable bond to another entity. 



20 



"Solid substrate" shall mean any suitable medium 
present in the solid phase to which an antibody or an 
agent may be a; 



Where a range of values is provided, it is understood 
that each intervening value, to the tenth of the unit 
of the lower limit unless the context clearly dictates 
otherwise, between the upper and lower limit of that 
range, and any other stated or intervening value in 
that stated range, is encompassed within the invention. 
The upper and lower limits of these smaller ranges may 
independently be included in the smaller ranges, and 
are also encompassed within the invention, subject to 
any specifically excluded limit in the stated range. 
Where the stated range includes one or both of the 
limits, ranges excluding either or both of those 
included limits are also included in the invention. 
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Embodiments of the Invention 



5 This invention provides a method for determining the 
sequence of a DNA comprising performing the following 
steps for each residue of the DNA to be secpjienced: 

(a) contacting the DNA with a DNA polymerase 
in the presence of (i) a primer and (ii) 
four nucleotide analogues under conditions 
permitting the DNA polymerase to catalyze 
DNA synthesis, wherein (1) the nucleotide 
analogues consist of an analogue of dGTP, 
an analogue of dCTP, an euialogue of dTTP 
15 or dUTP, and an analogue of dATP, (2) each 

nucleotide analogue comprises (i) a base 
selected from the group consisting of 
adenine, guanine, cytosine, thymine or 
uracil, and analogues thereof, (ii) a 
20 deoxyribose, (iii) a moiety cleavably 

linked to the 3' -oxygen of the deoxyribose 
and (iv) a unique label cleavably linked 
to the base, so that a nucleotide analogue 
complementary to the residue being 
25 sequenced is incorporated into the DNA by 

the DNA polymerase, and (3) each of the 
four analogues has a unique label which is 
different than the unique labels of the 
other three analogues ; 
30 (b) removing unbound nucleotide analogues; 

(c) again contacting the DNA with a DNA 
polymerase in the presence of (i) a primer 
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and (ii) four reversible terminators under 
conditions permitting the DNA polymerase 
to catalyze DNA synthesis, wherein (1) the 
reversible terminators consist of an 
5 analogue of dGTP, an analogue of dCTP, an 

analogue of dTTP or dUTP, and an analogue 
of dATP, (2) each nucleotide analogue 
comprises (i) a base selected from the 
group consisting of adenine, guanine, 
cytosine, thymine or uracil, and analogues 
thereof, which base does not have a unique 
label bound thereto, (ii) a deoxyribose, 
and (iii) a moiety cleavably linked to the 
3 '-oxygen of the deoxyribose; 
(ci) removing unbound reversible terminators; 
(e) determining the identity of the nucleotide 
analogue incorporated in step (a) via 
determining the identity of the 
corresponding unique label, with the 
proviso that step (e) can either precede 
step (c) or follow step (d) ; and 
following step (e) , except with respect to 
the final DNA residue to be sequenced, 
cleaving from the incorporated nucleotide 
analogues the unique label, if applicable, 
and the moiety linked to the 3' -oxygen 
atom of the deoxyribose, 
thereby determining the sequence of the DNA, 

3 0 This invention also provides the instant method, 
wherein step (e) is performed before step (c) . 



(f) 
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This invention also provides the instant method, 
wherein the moiety cleavably linked to the 3 '-oxygen 
of the deoxyribose is chemically cleavable or 
photocleavable. This invention also provides the 
instant method, wherein the moiety cleavably linked to 
the 3 '-oxygen of the deoxyribose in the nucleotide 
analogs of step (a) is an allyl moiety or a 2- 
nitrobenzyl moiety. 

This invention also provides the instant method, 
wherein the moiety cleavably linked to the 3' -oxygen 
of the deoxyribose in the reversible terminators of 
step (c) is an allyl moiety or a 2-nitroben2yl moiety. 

This invention also provides the instant method, 
wherein the unique label is bound to the base via a 
chemically cleavable or photocleavable linker. 

This invention also provides the instant method, 
wherein the unique label bound to the base via a 
cleavable linker is a dye, a fluorophore. a 
chromophore, a combinatorial fluorescence 
ir tag, a mass tag, or an electroph 



25 This invention also provides the instant method, 
wherein the moiety is chemically cleavable with 
Na,PdCl4/P(PhS03Na)3. This invention 'also provides the 
instant method, wherein the linker is chemically 
cleavable with NaaPdCl*/? (PhSOaNa) 3 . 

30 

This invention also provides the instant method, 
wherein the primer is a self -priming moiety. 
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This invention also provides the instant method, 
wherein the DNA is bound to a solid substrate. This 
invention also provides the instant method, wherein 
the DNA is bound to the solid substrate via 1.3- 
dipolar azide-alkyne cycloaddition chemistry. This 
invention also provides the instant method, wherein 
the DNA is bound to the solid substrate via a 
polyethylene glycol molecule.. This invention also 
provides the instant method, wherein the DNA is 
alkyne- labeled. This invention also provides the 
instant method, wherein the DNA is bound to the solid 
substrate via a polyethylene glycol molecule and the 
solid substrate is azide-functionalized. This 
invention also provides the instant method, wherein 
the DNA is immobilized on the solid substrate via an 
azido linkage. an alkynyl linkage, or biotin- 




ion. 



20 This invention also provides the instant method 
wherein the solid substrate is in the form of a chip, 
a bead, a well, a capillary tube, a slide, a wafer, a 
filter, a fiber, a porous media, or a column. This 
invention also provides the instant method, wherein 

J5 the solid substrate is gold, quartz, silica, plastic 
glass, diamond, silver, metal, or polypropylene. This 
invention also provides the instant method, wherein 
the solid substrate is porous. 



30 



This invention also provides the instant method 
wherein about lOOO or fewer copies of the DNA are 
bound to the solid substrate. This invention also 
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provides the instant invention wherein Ixio', 1x10* or 
1x10* or fewer copies of the DNA are bound to the solid 



5 This invention also provides the instant method, 
wherein the four nucleotide analogues in step (a) are 
3' -O-allyl-dGTP-allyl-CyS, 3 ' -O-allyl-dCTP-allyl- 

Bodipy-FL-510, 3 ' -O-allyl-dATP- allyl-ROX and 3'-0- 
allyl-dUTP-allyl-R6G..This invention also provides the 
10 instant method, wherein the four nucleotide analogues 
in step (a) are 3' -O-allyl-dGTP-allyl-Bodipy-FL-SlO, 
3' -O-allyl-dCTP-allyl-Bodipy-650, 3 ' -O-allyl-dATP- 

allyl-ROX and 3 ' -0-allyl-dUTP-allyl-R6Q. This 
invention also provides the instant method, wherein 
15 the four nucleotide analogues in step (a) are 3'-o- 
allyl-dGTP-allyl-Bodipy-6S0. 3 ' -O-allyl-dCTP-allyl- 

Bodipy-FL-siO, 3'-0-allyl-dATP-allyl-R0X and 3'-o- 
al ly 1 - dUTP - al ly 1 - R6G . 

20 It is understood that in other embodiments the 
nucleotide analogues are photocleavable . For example, 
2.nitrobenzyl can replace any of the allyl moieties in 
the analogues described herein. For example, 3'-0-2- 
nitrobenzyl-dGTP-allyl-Bodipy-SSO, 3 ' -0-2-nitroben2yl- 
25 dGTP-2-nitroben2yl-Bodipy-650, 3 ' -O-allly-dGTP-2- 
nitrobenzyl-Bodipy-650. One of skill in the art would 
recognize various other chemically cleavable or 
photochemically cleavable moieties or linkers that can 
be used in place of the examples described herein 
>0 Additionally, the unique labels may also be varied, 
and the examples set forth herein are non-limiting, in 
an embodiment uv light is used to photochemically 
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cleave the photochemical ly cleavable linkers and 
moieties. 

This invention also provides the instant method, 
5 wherein the reversible terminators in step (c) are 3'- 
0-allyl-dGTP, 3' -O-allyl-dCTP, 3 ' -O-allyl-dATP and 3'- 
0-allyl-dUTP, This invention also provides the instant 
method, wherein the reversible terminators in step (c) 
are 3' -O-2-nitrobenzyl-dGTP, 3' -O-2-nitrobenzyl-dCTP, 
10 3' -0-2-nitrobenzyl-dATP and 3' -0-2-nitroben2yl-dUTP. 
In an embodiment the reversible teirminator is 
incorporated into the growing strand of DNA. 

This invention also provides the instant method, 
15 wherein the DNA polymerase is a 9**N polymerase or a 
variant thereof. DNA polymerases which can be used in 
the instant invention include, for example E.Coli DNA 
polymerase I, Bacteriophage T4 DNA polymerase, 
Sequenase™, Taq DNA polymerase and S^N polymerase 
20 (exo-) A485L/Y409V.RNA polymerases which can be used 
in the instant invention include, for example. 
Bacteriophage SP6, T7 and T3 RNA polymerases. 

This invention also provides the instant method, 
25 wherein the DNA is bound to the solid substrate via a 
polyethylene glycol molecule and the solid substrate 
is azide-functionalized or the DNA is immobilized on 
the solid substrate via an azido linkage, an alkynyl 
linkage, or biotin-streptavidin interaction; wherein 
30 (i) the four nucleotide analogues in step (a) are 3'- 
O-allyl-dGTP-allyl-CyS, 3 ' -O-allyl-dCTP-allyl-Bodipy- 
FL-510, 3' -O-allyl-dATP- allyl-ROX and 3' -O-allyl- 
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dUTP-allyl-R6G, (ii) the four nucleotide analogues in 
step (a) are 3' -O-allyl-dGTP-allyl-Bodipy-FL-510, 3'-. 
O-allyl-dCTP-allyl-Bodipy-650, 3' -O-allyl-dATP-allyl- 
ROX and 3 ' -0-allyl-dUTP-allyl-R6G, or (iii) the four 
5 nucleotide analogues in step (a) are 3' -O-allyl-dGTP- 
allyl-Bodipy-650, 3' -O-allyl-dCTP-allyl-Bodipy-FL-510, 
3' -O-allyl-dATP-allyl-ROX and 3' -O-allyl-dUTP-allyl- 
R6G; and wherein the reversible terminators in step (c) 
are 3 ' -0-allyl-dGTP, 3 ' -O-allyl-dCTP, 3' -O-allyl-dATP 
10 and 3' -O-allyl-dUTP. 

This invention also provides a kit for performing the 
instant method comprising, in separate compartments, 

(a) nucleotide analogues of (i) GTP, (ii) ATP, 
^5 (iii) CTP and (iv) TTP or UTP, wherein 

each analogue comprises (i) a base 
selected from the group consisting of 
adenine, guanine, cytosine, thymine or 
uracil, or an analogue thereof, (ii) a 
10 deoxyribose, (iii) a cleavable moiety 

bound to the 3' -oxygen of the deoxyribose 
and (iv) a unique label boiand to the base 
via a cleavable linker, 

(b) reversible terminators comprising a 
5 nucleotide analogue of (i) GTP, (ii) ATP, 

(iii) CTP and (iv) TTP or UTP, wherein 
each analogue comprises (i) a base 
selected from the group consisting of 
adenine, guanine, cytosine, thymine or 
0 uracil, or an analogue thereof, which base 

does not have a unique label bound thereto, 
(ii) a deoxyribose, and (iii) a cleavable 
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moiety botmd to the 3 ' -oxygen of the 




10 



(c) reagents suitable for use in DNA 
polymerization; and 

(d) instructions for use. 

This invention further provides the instant kit, 
wherein the nucleotide analogues of part (a) are 3'-o- 
allyl-dGTP-allyl-CyS, 3 ' -O-allyl-dCTP-allyl-Bodipy-PL- 
510, 3'.0-allyl-dATP- allyl-ROX and 3 ' -O-allyl-dUTP- 
allyl-R6G. This invention further provides the instant 
kit, wherein the nucleotide analogues of part (a) are 
3' -O-allyl-dGTP-allyl-Bodipy-FL-SlO, 3 ' -O-allyl-dCTP- 
allyl-Bodipy-650 , 3'-0-allyl-dATP-allyl-R0X and 3 ' -O- 
15 allyl-dUTP-allyl-R6G. This invention further provides 
the instant kit, wherein the nucleotide analogues of 
part (a) are 3 ' -O-allyl-dGTP-allyl-Bodipy-650 , 3'-o- 
allyl-dCTP-allyl-Bodipy-FL-siO, 3 ' -O-allyl-dATP-allyl- 
ROX and 3'-0-allyl-dUTP-allyl-R6G. This invention 
20 further provides the instant kit, wherein the 
nucleotide analogues of part (b) are 3 ' -O-allyl-dGTP, 
3'-0-allyl-dCTP, 3'-0-allyl-dATP and 3' -O-allyl-dUTp! 
This invention further provides the instant kit, the 
reversible terminators in step (c) are 3'-o-2- 
25 nitrobenzyl-dGTP, 3 • -O-2-nitrobenzyl-dCTP, 3' -0-2- 
nitrobenzyl-dATP and 3 ' -o-2-nitrobenzyl-dUTP. 

The methods and kits of this invention may be applied 
mutatis mutandis, to the sequencing of rna, or to 
30 determining the identity of a ribonucleotide. 
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Methods for production of cleavably capped and/or 
cleavably linked nucleotide analogues are disclosed in 
U.S. Patent No. 6,664,079, which is hereby 
incorporated by reference. Combinatorial 
energy tags and methods for production thereof 
disclosed in U.S. Patent No. 6,627.748, which is 
hereby incorporated by reference. 

In an embodiment, the DNA or nucleic acid is attached 
/bound to the solid surface by covalent site-specific 
coupling chemistry conpatible with DNA. 

This invention will be better understood by reference 
to the Experimental Details which follow, but those 
skilled in the art will readily appreciate that the 
specific experiments detailed are only illustrative of 
the invention as described more fully in the claims 
which follow thereafter. 
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Experimental Detalla 

DNA sequencing by synthesis (SBS) on a solid surface 
during polymerase reaction offers a new paradigm to 
5 decipher DNA sequences. Disclosed here is the 
construction of such a novel DNA sequencing system 
using molecular engineering approaches. In this 
approach, four nucleotides (A, c. G, T) are modified 
as reversible terminators by attaching a cleavable 
10 fluorophore to the base and capping the 3' -OH group 
with a small chemically reversible moiety so that they 
are still recognized by DNA polymerase as substrates. 
It is found that an allyl moiety can be used 
successfully as a linker to tether a fluorophore to 
15 3'-0-allyl-„,odified nucleotides, forming chemically 
cleavable fluorescent nucleotide reversible 

terminators , 3 ' -o-allyl -dNTPa-e^llyl -fluorophore, for 
application in SBS. The fluorophore and the 3'-0-allyl 
group on a DNA extension product, which is generated 
10 by incorporating 3' -o.^iiyi.^ps-e^llyl. fluorophore in 
a polymerase reaction, are removed simultaneously in 
seconds by Pd-catalyzed deallylation in aqueous 
solution. This one-step dual -deallylation 
reaction thus allows the re-initiation of the 
polymerase reaction and increases the SBS efficiency 
DNA templates consisting of homopolymer regions were 
accurately sequenced by using this new class of 
fluorescent nucleotide analogues on a DNA chip and a 
4 -color fluorescent scanner. 



30 



It is known that some modified DNA polymerases are 
highly tolerable for nucleotides with extensive 
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modifications with bulky groups such as energy 
transfer dyes at the 5-position of the pyrimidines (T 
and C) and 7-position of purines (G and A) (23, 24). 
The ternary complexes of a rat DNA polymerase, a DNA 
5 template-primer, and dideoxycytidine triphosphate have 
been determined (22) which supports this fact. It was 
hypothesizrf that if a unique fluorescent dye is 
linked to the S-position of the pyrimidines (T and C) 
and the 7-poaition of purines (G and A) via a 
10 cleavable linker, and a small chemical moiety is used 
to cap the 3' -OH group, then the resulting nucleotide 
analogues may be able to incorporate into the growing 
DNA strand as terminators. Based on this rationale, 
SBS approach was conceived using cleavable fluorescent 
15 nucleotide analogues as reversible terminators to 
sequence surf ace -immobilized DNA in 2000 (Fig. i) (25) 
in this approach, the nucleotides are modified at two 
specific locations so that they are still recognized 
by DNA polymerase as substrates: (i) a different 
fluorophpre with a distinct fluorescent emission is 
linked to each of the 4 bases through a cleavable 
linker and (ii) the S-GH group is capped by a small 
chemically reversible moiety. DNA 

incorporates only a single nucleotide 

complementary to the base on a DNA template covalent^y 
linked to a surface. After incorporation, the unique 
fluorescence emission is detected to identify the 
incorporated nucleotide and the fluorophore is 
subsequently removed. The 3 • -OH group is then 
chemically regenerated, which allows the next cycle of 
the polymerase reaction to proceed, since the large 
surface on a DNA chip can have a high density of 



20 
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each cycle can 
allowing the 
number of dna 



different DNA templates 
identify many bases in 
simultaneous sequencing of a 
molecules. Thee feasibility of performing SBS on a 
chip using 4 photocleavable fluorescent nucleotide 
analogues was previously established 
discovered that an allyl group 
cleavable linker to bridge a 
nucleotide (27). The design and synthesis of two 
photocleavable fluorescent nucleotides as reversible 
terminators for polymerase reaction has already been 
reported (28, 29) . 



(26) and it was 
be used as a 
fluorophore to a 



15 



20 



25 



Previous research efforts in the present laboratory 
have firmly established the molecular level strategy 
to rationally modify the nucleotides by attaching a 
cleavable fluorescent dye to the base and capping the 
3' -OH with a small chemically reversible moiety for 
SBS. This approach was recently adopted by Genomics 
Industry to potentially provide a new platform for DNA 
sequencing (30). Here the design and synthesis of 4 
chemically cleavable fluorescent nucleotide analogues 
as reversible terminators for SBS is disclosed. Each 
of the nucleotide analogues contains a 3'-0-allyl 
group and a unique fluorophore with a distinct 
fluorescence emission at the base through a cleavable 
allyl linker. 



It was first established that these nucleotide 
30 analogues are good substrates for DNA polymerase in a 
solution-phase DNA extension reaction and that the 
fluorophore and the 3' -O-allyl group can be removed 
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with high efficiency in aqueous solution. Then SBS was 
performed using these 4 chemically cleavable 
fluorescent nucleotide analogues as reversible 
terminators to identify -20 continuous bases of a dna 
template immobilized on a chip. Accurate DNA sequences 
were obtained for DNA templates containing homopolymer 
sequences. The DNA template was immobilized on the 
surface of the chip that contains a PEG linker with 
1,3 -dipolar azide-alkyne cycloaddition chemistry. 
These results indicated that successful cleavable 
fluorescent nucleotide reversible terminators for 4- 
color DNA sequencing by synthesis can be designed by 
attaching a cleavable fluorophore to the base and 
capping the 3' -OH with a small chemically reversible 
moiety so that they are still recognized by DNA 
polymerase as substrates. Further optimization of the 

will lead to even longer 




20 Design and synthesis of < 

nucleotide analogues 



TO demonstrate the feasibility of carrying out de novo 
25 DNA sequencing by synthesis on a chip, four chemically 
cleavable fluorescent nucleotide analogues (3'-o- 
aJiyl-dCTP-allyl-Bodipy-FL-SlO, 3 ' -O-aJIyl-dUTP-ailyl- 
R6G, 3'-0-aJJy2-dATP.a2IyJ-ROX and 3 ' -O-aJJyl-dGTP- 
alIyi-Bodipy.650/CyS) (Pig. 2) were designed and 
30 synthesized as reversible terminators for DNA 
polymerase reaction. Modified DNA polymerases have 
been shown to be highly tolerant to nucleotide 
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modifications with bulky groups at the 5-position of 
pyrimidines (c and U) and the 7-position of purinfes (a 
and G) . Thus, each unique fluorophore was attached to 
the 5 position of C/U and the 7 position of a/g 
through an ally! carbamate linker. However, due to the 
close proximity of the 3' position on the sugar ring 
of a nucleotide to the amino acid residues of the 
active site of the DNA polymerase, a relatively small 
allyl moiety was chosen as the . 3 ' -OH reversible 
capping group, it was found that the fluorophore and 
the 3'-0-allyl group on a DNA extension product, which 
is generated by incorporation of the chemically 
cleavable fluorescent nucleotide analogues. are 
removed simultaneously in 30 seconds by Pd-catalyzed 
deallylation in aqueous solution. This one-step dual- 
deallylation reaction thus allows the re- initiation of 
the polymerase reaction. The detailed synthesis 
procedure and characterization of the 4 novel 
nucleotide analogues in Fig. 2 are described in 



25 



30 



ials and Methods, 
in order to verify that these fluorescent nucleotide 
analogues are incorporated accurately in a base- 
specific manner in a polymerase reaction. four 
continuous steps of DNA extension and deallylation 
were carried out in solution. This allows the 
isolation of the DNA product at each step for detailed 
molecular structure characterization by MALDi-top mass 
spectrometry (MS) as shown in Fig. 3. The first 
extension product 5' -U(allyl-R6G) -3' -O-allyl (1) was 
purified by HPLC and analyzed using MAUJI-TOF MS [Fig. 
3(A)1. This product was then deallylated using a Pd- 
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catalyzed deallylation cocktail [ix Thermopol i 
reaction buffer/NajPdGl^/PXPhSOsNa) 3] . The active Pd 
catalyst is generated from Na,PdCl« and a ligand 
P(PhS03Na), (TPPTS) to mediate the deallylation 
5 reaction in DNA compatible aqueous condition to 
simultaneously cleave both the fluorophore and the 3'- 
O-allyl group (28). The deallylated product (2) was 
also analyzed using MALDI-TOF MS [Fig. 3(B)]. As can 
be seen from Fig. 3(A),. the maldi-top MS spectrum 
consists of a distinct peak at m/z 6469 corresponding 
to the DNA extension product 5' — U(allyl-R6G) -3' -o- 
allyl (1), which confirms that the nucleotide analogue 
can be incorporated base specifically among pool of 
all four (A, C, G, T) by DNA polymerase into a growing 
DNA strand. Fig. 3(B) shows the deallylation result of 
the above DNA product. The peak at m/z 6469 has 
completely disappeared while the peak corresponding to 
the dual deallylated product 5'-U (2) appears as the 
sole dominant peak at m/z 5870, which establishes that 
the Pd-catalyzed deallylation reaction completely 
cleaves both the fluorophore and the 3'-0-allyl group 
with high efficiency without damaging the DNA. The 
next extension reaction was carried out using this 
deallylated DNA product with a free 3'-OH group 
regenerated as a primer along with four allyl modified 
fluorescent nucleotide mixture to yield an extension 
product 5'— UG{allyl-Bodipy-650)-3'.O-allyl (3). As 
described above, the extension product 3 was analyzed 
by MALDI-TOP MS producing a dominant peak at m/z 6984 
iFig. 3(C)], and then deallylated for further MS 
analysis yielding a single peak at m/z 6256 (product 4) 
[Fig. 3(D)]. The third extension reaction yielding 
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5'~UGA(allyl-ROX)-3' -O-allyl (S) , the fourth 
extension reaction yielding 5' — UGAC(allyl-Bodipy-FL- 
510)-3' -0-allyl (7) and their deallylation reactions 
to yield products 6 and 8 were similarly carried out 
and analyzed by MALDI-TOF MS as shown in Figs. 3(E), 
3(F). 3(G) and 3(H). The chemical structures of the 
extension and cleavage products for each step are 
shown in Fig. 4. These results demonstrate that the 
above -synthesized 4 chemically cleavable fluorescent 
nucleotide analogues are successfully incorporated 
with high fidelity into the growing DNA strand in a 
polymerase reaction, and furthermore, both the 
fluorophore and the 3'-0-allyl group are efficiently 
removed by using a Pd-catalyzed deallylation reaction, 
15 which makes it feasible to use them for SBS on a chip. 

4 -Color DNA sequencing with chemically cleavable 
fluorescene nucleotide analogues aa reversible 
terminators on a DNA chip 

2 0 The chemically cleavable fluorescent nucleotide 
analogues were then used in an SBS reaction to 
identify the sequence of the DNA template immobilized 
on a solid surface, A site-specific 1,3-dipolar 
cycloaddition coupling chemistry was used to 
25 covalently immobilize the alkyne- labeled self -priming 
DNA template on the azido-f unctionalized surface in 
the presence of a Cu(I) catalyst. The principal 
advantage offered by the use of a self -priming moiety 
as compared to using separate primers and templates is 
that the covalent linkage of the primer to the 
template in the self -priming moiety prevents any 
possible dissociation of the primer from the template 
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during the process of SBS. To prevent non-specific 
absorption of the unincorporated fluorescent 
nucleotides on the surface of the chip, a PEG linker 
is introduced between the DNA templates and the chip 
5 surface (Fig. 5) . This approach was shown to produce 
very low background fluorescence after cleavage to 
remove the fluorophore as demonstrated by the DNA 
secjuencing data described below. 

10 SBS was first performed on a chip- immobilized DNA 
template that has no homopolymer sequences using the 
four chemically cleavable fluorescent nucleotide 
reversible terminators (3' -O-allyi-dCTP-ailyi-Bodipy- 
FL-510, 3' -0-aIIyl-dUTP-aJIyl-R6G, 3' -O-aJIyl-dATP- 
15 allyl-ROX and 3 ' -O-allyl-dGTP-allyl-CyS) and the 
results are shown in Fig. 6. The structure of the 
self -priming DNA moiety is shown schematically in Fig. 
6A, with the first 13 nucleotide sequences immediately 
after the priming site. The de novo sequencing 
20 reaction on the chip was initiated by extending the 
self-priming DNA using a solution containing all four 
3' -O- allyl-dNTPs-allyl -fluorophore, and a 9^ mutant 
DNA polymerase. In order to negate any lagging 
fluorescence signal that is caused by previously 
25 unextended priming strand, a synchronization step was 
added to reduce the amoxmt of vmextended priming 
strands after the extension with the fluorescent 
nucleotides. A synchronization reaction mixture 
consisting of all four 3' -O-allyl-dNTPs (Fig. 7), 
30 which have a higher polymerase incorporation 
efficiency due to the lack of a fluorophore compared 
to the bulkier 3 ' -O-ailyl-dNTPs-allyl-f luorophore, was 
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used along with the 9°N mutant DNA polymerase to extend 
any remaining priming strand that has a free 3' -OH 
group to synchronize the incorporation. The extension 
by 3' -O-allyl-dNTPs also enhances the enzymatic 
incorporation of the next nucleotide analogue, because 
after cleavage to remove the 3'-0-allyl group, the DNA 
product extended by 3 ' -O-allyl-dNTPs carry no 
modification groups. After washing, the extension of 
the primer by only the complementary fluorescent 
nucleotide was confirmed by observing a red signal 
(the emission from Cy5) in a 4-color fluorescent 
scanner [Fig. SB (1)]. After detection of the 
fluorescent signal, the chip surface was immersed in a 
deallylation cocktail [IX Thermolpol I reaction 
15 buffer/Na2PdCl4/P(PhS03Na)3] and incubated for 5 min at 
60 ""C to cleave both the fluorophore and 3'-0-allyl 
group simultaneously. The chip was then immediately 
immersed in a 3 M Tris-HCl buffer (pH 8.5) and 
incubated for 5 min at 60 to remove the Pd complex. 
The surface was then washed, and a negligible residual 
fluorescent signal was detected to confirm cleavage of 
the fluorophore. This was followed by another 
extension reaction using 3' -O-allyl-dNTPs-aJIyl- 
fluorophore mixture to incorporate the next 
25 fluorescent nucleotide complementary to the subsequent 
base on the template. The entire process of 
incorporation, synchronization, detection and cleavage 
was performed multiple times using the four chemically 
cleavable fluorescent nucleotide reversible 

30 terminators to identify 13 successive bases in the DNA 
template. The fluorescence image of the chip for each 
nucleotide addition is shown in Fig. 6B, while a plot 
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of the fluorescence intensity vs. the progress of 
sequencing extension (raw 4 -color secjuencing data) is 
shown in Fig. 6C. The DNA sequences are unambiguously 
identified from the 4 -color raw fluorescence data 
5 without any processing. 



Coinparlson of 4 -color SBS with pyrosequenclng. 

To further verify the advantage of SBS method using 
the four chemically cleavable fluorescent nucleotide 
10 reversible terminators, we carried out similar 
sequencing reaction as described above on a DNA 
template which contained two separate homopolymeric 
regions (stretch of 10 T's and 5 T's) as shown in Fig. 
8 (panel A) . These sequencing raw data were produced 
15 by adding all 4 fluorescent nucleotide reversible 
terminators together to the DNA template immobilized 
on the chip followed by synchronization reaction with 
four 3'-0-allyl-dNTPs, detecting the unique 
fluorescence emission for sequence determination, then 
cleaving the fluorophore and the 3'-0-allyl group in 
one step to continue the sequencing cycles. All 20 
bases including the individual base (A, T, C, G) , the 
10 repeated A' s and the 5 repeated A' s are clearly 
identified. The small groups of peaks between the 
2 5 identified bases are fluorescent background from the 
DNA chip, which does not build up as the cycle 
continues. Panel B in Fig. 8 shows the pyrosequenclng 
data of the same DNA template containing the 
homopolymeric sequences. The first 4 individual bases 
30 are clearly identified. The two homopolymeric regions 
(10 A's) and (5 A's) produce two large peaks, but it 
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is very difficult to identify the exact sequence from 
the data. 
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25 



Conclusion 

5 Four novel chemically cleavable fluorescent nucleotide 
analogues have been synthesized and characterized and 
have been used to produce a 4 -color de novo DMA 
sequencing data on a chip. m doing so, two critical 
requirements for using SBS method to sequence DNA have 
10 been achieved unambiguously. First, a strategy to use 
a chemically reversible moiety to cap the 3' -OH group 
of the nucleotide has been successfully implemented so 
that the nucleotide terminates the polymerase reaction 
to allow the identification of the incorporated 
nucleotide, in addition these reversible terminators 
allow for the addition of all four nucleotides 
simultaneously in performing SBS. This ultimately 
reduces the number of cycles needed to complete the 
sequencing cycle, increases sequencing accuracy due to 
20 competition of the 4 nucleotides in the polymerase 
reaction, and enables accurate determination of 
homopolymeric regions of dna template. Second, 
efficient removal of both the fluorophore and the 3'- 
OH capping group after the fluorescence signal 
detection have successfully been carried out which 
increases the overall efficiency of SBS. 



The key factor governing the sequencing readlength of 
our 4 -color SBS approach is the stepwise yield that 
30 are determined by the nucleotide incorporation 
efficiency and the yield of the cleavage of the 
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fluorophore and the 3' -OH capping group from the DNA 
extension products. This stepwise yield here is - 99% 
based on measurement of the DNA products in solution 
phase. The yield on the surface is difficult to 
5 measure precisely due to fluctuations in the 
fluorescence imaging using the current manual 
fluorescent scanner. The strong fluorescence signal 
even for the 20"* base shown in Pig. a indicates that 
we should be able to extend the readlength even 
10 further. In terms of readlength, Sanger sequencing is 
still the gold standard with readlength of over 800 bp 
but limited in throughput and cost. The readlength of 
pyrosequencing is -lOQ bp but with high error rate due 
to difficulty in accurately determining the sequences 
of homopolymers . The 4 -color SBS readlength on a 
manual fluorescent scanner is currently at - 20 bp 
with high accuracy. This readlength will increase with 
automation of the extension, cleavage and washing 
steps. The DNA polymerases and fluorescent labeling 
used in the automated 4-color Sanger sequencing method 
have undergone almost two decades of consistent 
incremental improvements after the basic fluorescent 
Sanger methods were established (2, 3). Following the 
same route, it is expected that the basic principle 
and strategy outlined in our 4-color SBS method will 
stimulate further improvement of the sequencing by 
synthesis methodology with engineering of high 
performance polymerases tailored for the cleavable 
fluorescent nucleotide terminators and testing 
30 alternative linkers and 3' -OH reversible capping 
moiety. It has been well established that using 
emulsion PCR on microbeads, millions of different DNA 



20 



25 



wo 2008/069973 



PCT/l)S2007/024646 



34 



templates are immobilized on a surface of a chip (n, 
16). This high density DMA templates coupled with our 
4-color SBS approach will generate a high- throughput 
(> 20 millions bases/chip) and high-accurate platform 
for a variety of sequencing and 
expression analysis projects. 



Materials and Methods 
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NMR spectra were recorded on a Bruker DPX-400 (400 
MHz) spectrometer and reported in ppm from a CD3OD or 
DMSO-d« internal standard (3.31 or 2.50 ppm 
respectively). Data were reported as follows: (a = 
singlet, d = doublet, t = triplet, q = quartet, m , 
multiplet. dd = doublet of doublets, ddd = doublet of 
doublets of doublets; coupling constant (s) in Hz; 
integration; assignment) . Proton decoupled "c nm^ 
spectra were recorded on a Bruker DPX-400 (100 MHz) 



HO 




Ruorophor* 



MO 



H 
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3*>0.«lly|^CTP.a||y|.aod«py^L^1 0 
3'-0-aPy*HiWTP-NM, 3'-0.«JIy|.dUT|i-.||,|.||«o 
c 9'-0-«lly|^TP4lly|.fl0X 

3*-O-«lty^daTP^II|f|«o4lpy^S0 |eyS) 

Fluorophore-Bodlpy-FL S 10: fldG; ROX; Bodlpy-OSO; CyS 

3'.0.«flyH>NTP.NH, . 3-<>'«Oy..5H3..n.lnoprop.1.y„yO-2Vd.o.vcytidIne.5-.lrtpho.pha 

« 3' 0-allyl-»H3.«mlnopfOp.l -ynyl).2'.deo»yurtd«#.5'.tflphoaph*i. 
.3^0-ally|.7.<3..mi«opropo.y„yi>.7HJe.2a-2--doo«y.denoiina.a-.|rlpl,p.ph.tt 

Synthetic scheme to Drenare ^>,o ^v,^^ • 

Prepare the chemically cleavable 

fluorescent nucleotides, a, DMF / i m NaHCOa solution; b 
iST-disuccinimidyl carbonate (DSC), triethylamine; c, o/i m 
NaHCOa / NaaCOa aqueous buffer (pH 8.7) 
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spectrometer and reported in ppm from a CD3OD. DMSO-d, 
or CDCl, internal standard (49.0. 39.5 or 77.0 ppm 
respectively). Proton decoupled "p nmr spectra were 
5 recorded on a Bruker DPX-300 (121.4 MHz) spectrometer 
and reported in ppm from an 85% H,PO« external standard 
High Resolution Mass Spectra (HRMS) were obtained on a 
JEOL JMS HX 11 OA mass spectrometer. Compounds 3 0 and 
32 were purchased from Berry & Associates. All Dye NHS 
10 esters were purchased from Molecular Probes and ge 
Healthcare. All other chemicals were purchased from 
Sigma -Aldrich. 

I. Synthesis of 3' -o-allyl-dNTPs-allyl-Fluorophore. 
15 Chemically cleavable fluorescent nucleotides 3'-o- 
allyl-dCTP-allyl-Bodipy-PL-SlO, 3 ' -O-allyl-dUTP-allyl- 
RSO, 3'-0-allyl-dATP-allyl-R0X. 3 ' -O-allyl-dGTP-allyl- 
Bodipy-650 and 3 ' -O-allyl-dGTP-allyl-CyS 

synthesized according to the above Scheme. a 
20 chemically cleavable linker 4-amino-2-methylene.i. 
butanol (allyl-linker) was reacted with the n- 
hydroxysuccinimide (NHS) ester of the corresponding 
fluorescent dye to produce an intermediate allyl- 
fluorophore, which was converted to an allyl 
5 fluorophore NHS ester by reacting with N.N'- 
disuccinimidyl carbonate. The coupling reaction 

between the different allyl -fluorophore NHS eaters and 
3'-0-allyl-dNTPs-NH, produced the four chemically 
Cleavable fluorescent nucleotides. 

1. Synthesis of allyl-linker 
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(-'Y^OH !!^E£i_"'»<^-0^~S^OH CBr«/PPh3 P^C.^^^N^p, TMSCN i TBAf 

J E.,N/CH^I, II CH^b » CH,CN.20a«n 



HO' 

62% 2 a 



J' ether II ch.oh' 




CHjOH 

65% 5 95% 6 



2-Triphenylmethoxyinethyl-2-propen-l-ol (2) . To a 
stirred solution of trityl chloride (4.05 g; 14.3 ininol) 
5 and 2-methylenepropane-l,3-diol l (1.20 mL; 14.3 nunol) 
in dry CHjClj (20 mL) , triethylamine (4.0 mL; 28.5 mmol) 
was added slowly at room temperature. The mixture was 
stirred at room temperature for 1 h, and then ethyl 
acetate (lOO mL) and saturated aqueous NaHCOa (30 mL) 
10 were added. The organic phase was washed with 
saturated aqueous NaHCOa, NaCl, and dried over 
anhydrous Na^SO, . After evaporation, the residue was 
purified by flash column chromatography using ethyl 
acetate -hexane (1:10-5) as the eluent to afford 2 as a 
15 white solid (2.89 g; 62% yield): NMR (400 MHz, CDCI3) 
S 7.42-7.48 (m. 6H, six of ArH) , 7.27-7.33 (m, 6H. 
of ArH), 7.20-7.27 (m. 3H, three of ArH), 5.26 (s, IH, 
one of C=CH2), 5.17 (s, IH, one of C-C/fe) , 4.13 (d, J I 
6.1 HZ, 2H. CH2OH), 3.70 (s. 2H, PhaCOCH,) , 1.72 (t, J 
20 = 6.1 Hz, IH, CH2OH); "C NMR (100 MHz, CDCI3) 5 145.4. 
143.6, 128.3, 127.6, 126.8, 111.6, 87.0, 65.3, 64.5. 

l-Bromo-2-triphenylmethoxymethyl.2-propene (3) . To a 
stirred solution of 2 (2.56 g; 7.74 mmol) in CH,C1, (75 
25 ml), CBr« (3.63 g; 10.83 mmol) and triphenylphosphine 
(2.47 g; 9.31 mmol) were added respectively at 0 "c. 
The mixture was stirred at room temperature for 40 min. 



sxx 
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10 



Ethyl acetate (100 mL) and saturated aqueous NaHCOj (3o 
mL) were added at 0»C. The organic phase was washed 
with saturated aqueous NaCl, and dried over anhydrous 
NaaSO*. After evaporation, the residue was purified by 
flash column chromatography using CHaClj-hexane (1:5) 
as the eluent to afford 3 as white solid (3.02 g; 92% 

yield): NMR (400 MHz, CDClj) 5 7.42-7.48 (m, 6H, six 
of ArH), 7.27-7.33 (m, 6H, six of ArH) . 7.20-7.27 (m 
3H, three of ArH), 5.37 (s, IH, one of CsCHj) , 5.31 (s, 
IH, one of C=C«i), 4.01 (s, 2H, CHaBr) , 3.75 (a, 2H. 
PhaCOCHj); "C NMR (100 MHz, CDCI3) 5 143.6, 142.6. 
128.4, 127.6, 126.9, 115.8, 86.9, 64.2, 33.5. 

3-Triphenylmethoxymethyl-3-butenonitrile (4). to a 
15 stirred solution of 3 (1.45 g; 3.69 mmol) and in dry 
CH3CN (37 mL), trimethylsilyl cyanide (0.49 mL; 3.69 
mmol) was added. Then, 1 m tetrabutylammonium fluoride 
(TBAF) in THP solution (3.69 mL, 3.69 mmol) was added 
into the above reaction mixture slowly at room 
20 temperature. The mixture was stirred for 20 min. After 
evaporation, the residue was diluted with ethyl 
acetate (lOO mL) and saturated aqueous NaHCOj (30 mL) . 
The organic phase was washed with saturated aqueous 
NaCl and dried over anhydrous Na^SO^ . After evaporation, 
25 the residue was purified by flash column' 
chromatography using acetate-hexane (l:io) as the 
eluent to afford 4 as white solid (i.oi g; 64% yield): 
^HNMR (400 MHZ. CDCI3) 5 7.39-7.45 (m, 6H, six of ArH). 
7.21-7.34 (m, 9H. nine of ArH). 5.31 (s, 2H, C=CHj) . 
iO 3.64 (s, 2H, PhjCOC/fa), 3.11 (s, 2H, CH.CN) ; »C NMR 
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(100 MHz, CDClj) 8 143.3, 135,5, 128.2, 127.7, 126.9, 
116.8, 114.7, 87.0, 65.7, 21.9. 

3-Triphenylmethoxymethyl-3-buten-l-amine (5) . To a 
5 stirred solution of LiAlH4 (119 mg; 2.98 mmol) in dry 
ether (5 mL) , AICI3 (400 mg; 2.94 mmol) was added 
slowly at 0 and the mixture was stirred for 15 min. 
The mixture of 4 (829 mg; 2.44 mmol) in dry ether (9 
mL) was added and then continued to stir at 0 ^'C for 
10 another 3h. Afterwards, 10% aqueous NaOH (10 mL) was 
added to quench the reaction. The organic phase was 
washed with saturated aqueous NaHCOj, NaCl, and dried 
over anhydrous K2CO3. After evaporation, the residue 
was further purified by flash column chromatography 
15 using CHaOH-CHaCla (1:20-5) as the eluent to afford 5 
as colorless oil (545 mg; 65% yield) : NMR (400 MHz, 
CDCI3) 5 7.41-7.48 (m, 6H, six of ArH) , 7.26-7.33 (m, 
6H, six of ArH), 7.19-7.26 (m, 3H, three of ArH), 5.33 
(s, IH, one of C^CHz) , 4.96 (s, IH, one of C=CH2) , 3.53 
20 (s, 2H, PhaCOCHa) , 2.70 (m, 2H, CH2CH2NH2) , 2.18 (t, J- = 
6.7 Hz, 2H, CH2CH2NH2), 2.06 (br s, 2H, NH2) ; "c NMR 
(100 MHz, CDCI3) 5 143.6, 143.4, 128.1, 127.4, 126.5, 
111.3, 86.5, 66.1, 39.8, 37.4; HRMS (FAB^) calcd for 
C24H26ON (M+H*) : 344.2014, found: 344.2017. 

25 

4 -Amino- 2- methylene- 1-butanol (6) . To a stirred 
solution of 5 (540 mg; 1.57 mmol) in CH3OH (11 mL) , HCl 
(2M solution in ether; 5.5 mL) was added at room 
temperature and the mixture was stirred for 1 h. Then 
30 7M ammonia in CH3OH solution (2.7 mL) was added into 
the above mixture at room temperature and continued to 
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stir for another 10 min. After filtration, the solid 
was washed with CHaOH and combined with the filtrate. 
After evaporation, the crude product was further 
purified by flash column chromatography using CH3OH- 
5 CH3CI3 (1:4) as the eluent to afford 6 as colorless oil 
(151 mg; 95% yield): NMR (400 MHz, CD3OD) 5 5-19 (s, 
IH, one of C^CHa) , 5.01 (s, IH, one of C^CHa) , 4.06 (s, 
2H, CH2OH), 3.10 (t, J" = 7.5 Hz, 2H, CH2CH2NH2) , 2.46 (t, 
J * 7.5 Hz, 2H, CH2CH2NH2); NMR (100 MHz, CDjOD) S 

10 145-3, 113.7, 65.5, 39.5, 32.0; MS (FAB+) calcd for 
CsHiaON (M+H^) : 102.09, found: 102.12. 

2. Synthesis of Allyl-Fluorophore 

A general procedure for the synthesis of Allyl- 
15 Fluorophore is as follows. To a stirred solution of 6 
(3.5 mg; 34.6 /atooI) in DMF (450 /iL) , aqueous NaHCOa (IM 
solution; 100 fili) was added at room temperature. The 
mixture was stirred for 5 min. Dye NHS {N- 
hydroxy sue cinimide) ester (5 mg) in DMF (450 fxL) was 
2 0 added and then the mixture was stirred at room 
temperature for 6 h. The crude product was further 
purified by a preparative TLC plate using CH3OH-CH2CI2 
as the eluent to afford Allyl-Fluorophore. 



25 Allyl-Bodipy-FL-510 (7). ^H NMR (400 MHz, CD3OD) 6 7.42 
(s, IH) , 7.00 (d, i7 = 4.0 Hz, IH) , 6.32 (d, iJ=4.0Hz, 
IH) , 6.21 (s, IH) , 5.06 (s, IH, one of C=CH2) , 4.87 Is, 
IH, one of C=Ci/2, partly superimposed by solvent 
signal), 4.01 (s, 2H, CH2OH) , 3.33 (t, J = 7.5 Hz, 2H, 

30 partly superimposed by solvent signal), 3.21 (t, J = 
7.7 Hz, 2H) , 2.59 (t, J = 7.7 Hz , 2H) , 2.51 (s, 3H, 
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one of ArCHj) , 2.28 (a. 3H) , 2.26 (t, J = 7.1 Hz, 2H). ; 
HRMS (FAB+) calcd for Ci9H2402N3FaB (M*) : 375.1933, found: 
375.1957 . 

5 Allyl-R6G (8). NMR (400 MHz, CD3OD) 6 8.12 (d, J » 

8.1 Hz, IH) , 8.05 (dd, J" = 1.8, 8.1 Hz, IH) , 7.66 (d, 
J = 1.6 Hz, IH) , 7.02 (s, 2H) , 6.88 (3, 2H) , 5.08 (s, 
IH, one Of C=CH2), 4.92 (s, IH, one of C-CHj) , 4.06 (s, 
2H. CHaOH) , 3.48-3.56 (m, 6H) , 2.40 (t, J « 7.2 Hz. 2H) . 
10 2.13 (S, 6H), 1.36 (t, J- - 7.2 Hz, 6H) ; HRMS (PAB+) 
calcd for CaaHsjOsNa (M+H*) : 542.2655, found: 542.2648. 

Allyl-ROX (9). *H NMR (400 MHz, CD3OD) 5 8.03 (d, J = 
8.1 Hz, IH) , 7.98 (dd, J « 1.6, 8.1 Hz, IH) , 7.60 (d, 

15 J- = 1.4 Hz, IH) , 6.75 (s, 2H) , 5.08 (s, IH. one of 
C=CJfa) » 4.91 (s, IH, one of C^CHa) , 4.05 (s, 2H, CHaOH) , 
3.45-3.57 (m, lOH) , 3.03-3.10 (m, 4H) , 2.64-2.73 (m, 
4H) , 2.38 (t, J = 7.1 Hz, 2H) , 2.04-2.15 (m, 4H) , 
1.89-1.99 (m, 4H) ; HRMS (FAB+) calcd for C38H40O5N3 

20 (M+H*): 618.2968, foxind: 618.2961. 

Allyl-Bodipy-650 (10). NMR (400 MHz, CD3OD) 6 8.12 

(dd, J - 0.9, 3.8 Hz, IH) , 7.63 (m, 3H) , 7.54 (d, J » 
6.4 HZ, 2H), 7.35 (s, IH) . 7.18-7.22 (m, 2H) , 7.12 (m, 
2H) , 7.06 (d, J = 8.8 Hz, 2H) , 6.85 (d, J" = 4.2 Hz, 
IH) , 5.06 (s, IH, one of C=CHa) , 4.86 (s, IH, one of 
C=CH2),4.56 (s, 2H) , 4.00 (s, 2H, CH3OH) , 3.28 (m, 4H) , 
2.23 (t, J = 7.1 Hz, 2H), 2.14 (t, J = 7.5 Hz, 2H) , 
1.49-1.62 (m, 4H) , 1.25-1.34 (m, 2H) ; HRMS (FAB+) 
calcd for C34H3704N4F2SB (M*) : 646.2603, found: 646.2620. 



25 



30 
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Allyl-Cy5 (11). NMR (400 MHz, CD3OD) 5 7.75-7.86 (m, 
2H), 7.43-7,62 (m, 3H) , 6.65 (m, IH) , 6.25-6.53 (m, 
5H>, 5,06 (s, IH, one of C=CHa) , 4.86 (s, IH, one of 
C^Cifa), 4.56 (s, 2H), 4.00 (s, 2H, CHaOH) , 3.28 (tn. 6H) , 
5 2.03-2.40 (m, 4H) , 1.55 (t, iJ= 7.2 Hz, 3H) , 1.31 (s, 
6H), 1.26 (s, 6H), 1.25-1.64 (m, 6H) ; HRMS (FAB+) 
calcd for C38H4BO8N3S2 (M*) : 738.2888, found: 738.2867. 

3. Synthesis of Allyl-Fluorophore NHS ester 
10 A general procedure for the synthesis of Allyl-Dye NHS 
ester is as follows. To a stirred solution of Allyl- 
Fluorophore (4 mg> in dry DMF (350 /iL) , DSC (8.0 mg; 
31.2 pLtnol) and triethylamine (5.0 ^L; 3 5.4 /imol) were 
added respectively. The reaction mixture was stirred 
15 at room temperature for 10 h. After evaporation, the 
crude product was further purified by flash column 
chromatography using CH3OH-CH2CI2 as the eluent to 
afford Allyl-Fluorophore NHS ester, which was used 
directly for the next step. 

20 

4. Synthesis of 3' -O-allyl-dCTP-allyl-Bodipy-FL-SlO 
(23) 
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OM OH O-s^ 

IT It 1* ^ 

JJ J J T 1)POCI,/PO(OM.>, 

HO-xO^IJl"'^ ^ NHCOCF, ^q^^o^M"^ 2) (ButNMUPjOy 




6^^^ E»jNiDMF 0^^^ 4)NH,(aQ) 



20 21 



^ ^ ^ ^ fy^ta^ a«yl.eOOIPY.FL.510NMS«.lef 



/V/W'V 0«F/N.HCO,.N^O, 

O 
22 



0. 0^^0.^0 

o ao o*o O" 




3'-O>aUyMCTP-alfyl-Bodipy-FL-5l0 23 



5' -O- ( tex-t-Butyldimethylsilyl) -5-iodo-2' -deoxycytidine 
(18). To a stirred mixture of 17 {1.00 g; 2,83 nrniol) 
5 and imidazole (462 mg; 6.79 mmol) in anhydrous DMF 
(14.0 ml,), tert-butyldimethylsilyl chloride (TBDMSCl) 
(510 tng; 3.28 mmol) was added. The reaction mixture 
was stirred at room temperature for 20 h. After 
evaporation, the residue was purified by flash column 
10 chromatography using CH3OH-CH2CI2 (1:20) as the eluent 
to afford 18 as white solid (1.18 g; 89% yield): NMR 
(400 MHz, CD3OD) 6 8.18 (s, IH, 6-H) , 6.17 (dd, 5.8, 
7.5 Hz, IH, I'-H), 4.34 (m, IH, 3'-H), 4.04 (m, IH, 
4'-H), 3.93 (dd, 4J = 2.5, 11.6 Hz, IH, one of 5'-H), 
15 3,84 (dd, J- = 2.9, 11.6 Hz, IH, one of 5'-H), 2.41- 
2.48 (ddd, J- = 2,5, 5.8, 13.5 Hz, IH, one of 2'-H), 
2.01-2.08 (ddd, il = 5.9, 7.6, 13.5 Hz, IH, one of 2'- 
H) , 0.95 (s, 9H, C(CH3)3), 0.17 (s, 3H, one of SiCHa) , 
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0.16 (s, 3H, one of SiCHa) ; ^^C NMR (100 MHz, CD3OD) 8 
165.5, 156.8, 147.8, 89.4, 88.3, 72.8, 64.6, 57.1, 
43.1, 26.7, 19.4, -4.8, -4.9; HRMS (FAB+) calcd for 
CisH2704N3SiI (M+H*) : 468.0816, found: 468.0835. 

5 

3' -O-Allyl-5' -O- ( tert-butyldimethylsilyl) -5-icdo-2' - 
deoxycytidine (19). To a stirred solution of 18 (1.18 
g; 2.52 mmol) in anhydrous THF (43 rtiL) , 95% NaH powder 
(128 mg; 5.07inmol) was added. The suspension mixture 
10 was stirred at room temperature for 45 min. Ally! 
bromide (240 p.L, 2.79 mmol) was then added at 0 «C and 
the reaction was stirred at room temperature for 
another 14 h with exclusion of moisture. Saturated 
aqueous NaHCOa (10 mL) was added at 0 «C and the 
15 reaction mixture was stirred for 10 min. After 
evaporation, the residue was dissolved in ethyl 
acetate (150 mL) . The solution was then washed with 
saturated aqueous NaHCOa and NaCl, and dried over 
anhydrous Na2S04 . After evaporation, the residue was 
20 purified by flash column chromatography using ethyl 
acetate as the eluent to afford 19 as white solid (601 
mg; 47% yield): NMR (400 MHz, CD3OD) 5 8.15 (s, IH, 

6-H), 6.12 (dd, J- = 5.6, 8.0 Hz, IH, 1' -H) , 4.17 (m, 
IH, 4'-H), 4.14 (m, IH, 3'-H), 3.98-4.10 (m, 2H, 
25 CHaCH^CHa), 3.93 (dd, J ^ 2.8, 11.5 Hz, IH, one of 5' - 
H) , 3.83 (dd, J- = 2.8, 11.5 Hz, IH, one of 5'-H), 
2.53-2.60 (ddd, J * 1.7, 5.6, 13.6 Hz, IH, one of 2'- 
H), 1.94-2.02 (ddd, J = 5.9, 8.0. 13.6 Hz, IH, one of 
2'-H), 0.94 (s, 9H. C(CH3)3), 0.17 (s, 3H, one of 
30 SiCHa), 0.16 (s, 3H, one of SiCHa) ; "c NMR (100 MHz, 
CD3OD) 5 165.4, 156.7, 147.7, 135.5, 117.2, 88.2, 87.0, 
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80.4, 70.9, 64.8, 57.3, 40.1, 26.7, 19.4, -4.7, -4.9; 
HRMS (FAB+) calcd for Ci8H3i04N3SiI (M+H^^) : 508.1129, 
found: 508.1123. 

5 3' -0-Allyl-5-iodo-2' -deoxycytidine (20). To a stirred 
solution of 19 (601 mg; 1.18 mmol) in anhydrous THP 
(28 mL) , 1 M TBAF in THF solution (1.31 mL; 1.31 mmol) 
was added and the reaction mixture was stirred at room 
temperature for 1 h. After evaporation, the residue 
10 was dissolved in ethyl acetate (100 mL) . The solution 
was then washed with saturated aqueous MaCl and dried 
over anhydrous NaaSO^. After evaporation, the residue 
was purified by flash column chromatography using 
CH3OH-CH2CI2 (1:10) as the eluent to afford 20 as white 
15 crystals (329 mg; 71% yield) : NMR (400 MHz, CD3OD) 5 
8.47 (s, IH, 6-H), 6.15 (dd, i7= 6.2, 6,7 Hz, IH, 1'- 
H), 5.87-5.98 (m, IH, CHaCH^^CHa) , 5.26-5.33 (dm, J = 
17.2 Hz, IH, one of CHaCH^CHa) , 5.14-5.19 (dm, J" = 10.5 
Hz, IH, one of CHaCH^CHa) , 4.18 (m, IH, 3'-H), 4.08 (m, 
20 IH, 4'-H), 3.98-4.10 (m, 2H, CHaCH=CHa) , 3.82 (dd, 

3.2, 13.0 Hz, IH, one of 5'-H), 3.72 (dd, J = 3.3, 
13.0 Hz, IH, one of 5'-H), 2.44-2.51 (ddd, J « 3.2, 
6.0, 13.6 Hz, IH, one of 2'-H), 2.07-2.15 (m, IH, one 
of 2'-H); "C NMR (100 MHz, CD3OD) 5 165.4, 156.9, 148.8, 
25 135.6, 117.0, 87.9, 86.9, 79.6, 71.2, 62.7, 57.2, 39.7; 
HRMS (FAB+) calcd for C12H17O4N3I (M+H'") : 394.0264, found: 
394.0274 • 

3' -O-Allyl-5- {3- [ (trifluoroacetyl) amino] prop- 1-ynyl } - 
30 2' -deoxycytidine (21). To a stirred solution of 20 
(329 mg; 0.84 mmol) in anhydrous DMF (3.7 mL) , 
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tetrakis(triphenylphosphine)palladium(0) (97 mg; 0.084 
mmol) and Cul (35 mg; 0.18 mmol) were added. The 
reaction mixture vras stirred at room temperature for 
10 man. Then i»-propargyltrif luoroacetamide (379 mg; 
5 2.51 mmol) and EtjN (233 ^L; 1.68 mmol) were added into 
the above reaction mixture. The reaction was stirred 
at room temperature for 1.5 h with exclusion of air 
and light. After evaporation. the residue was 
dissolved in ethyl acetate (loo mL) . The mixture was 
10 washed with saturated aqueous NaHCO,, NaCl, and dried 
over anhydrous Na^SO*. After evaporation, the residue 
was purified by flash column chromatography using 
CH30H-CH,C1, (0-1:10) as the eluent to afford 21 as 
yellow crystals (290 mg; 83% yield) : NMR (400 MHz, 
15 CD3OD) 5 8.31 (s. IH, 6-H). 6.17 (dd. ^ = 6.0. 7.3 Hz, 
IH^ I'-H). 5.87-5.97 (m. IH. CH,CH=CH,) . 5.26-5.33 (dm. 
J= 17.3 Hz. IH. one of CH^CH^CHa) . 5.15-5.19 (dm. J = 
10.4 HZ. IH, one of CHaCH«C«a), 4.31 (s. 2H. CsCCif,) , 
4.17 (m. IH, 3'-H). 4.09 (m, IH, 4'-H). 3.98-4.10 (m 
^0 2H, CH,CH=CH.), 3.80 (dd, ^ - 3.4, 12.0 Hz, IH, one of 
^'~^^» 3.72 (dd, «7 s 3 6 15 n tr» im ^ 

' J.o, Hz, IH, one of 5'-h), 

2.46-2.53 (ddd, »7=29 s^i^£u 

. u 5.3^ 13.6 Hz, IH, one of 2'- 

H). 2.04-2.12 (m. IH, one of 2'-H); "c NMR (100 MHz 
CD3OD) 5 166.0, 158.4 (q. J . 38 Hz, CDCF3) , 156. 3' 
5 145.8, 135.6. 117.1 (q. jr , hz. COCP3) . 117.0. 91.9 

90.7, 88.0. 87.0. 79.8, 75.5, 71.2. 62.8. 39.6. 31. 0-' 
HRMS (FAB*) calcd for CH.oOsN.P, (M*H*) : 417.1386. 
found: 417.1377. 



30 3' -O-Allyl-5- (3 -aminoprop-l-ynyl ) -2 ' -deoxycytidine-5 ' - 
triphosphate (22). 21 (133 mg; 0.319 mmol) and proton 
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sponge (83.6 mg; 0.386 mmol) were dried in a vacuum 
desiccator over P2O5 overnight before dissolving in 
trimethylphosphate (0.60 mL) . Freshly distilled POCI3 
(36 /iL; 0.383 mmol) was added dropwise at 0 ®C and the 
5 mixture was stirred for 3 h. Then the solution of 
tributylammonium pyrophosphate (607 mg) and 
tributylamine (0.61 mL; 2.56 mmol) in anhydrous DMF 
(2.56 mL) was well vortexed and added in one portion 
at room temperature cuid the reaction mixture was 
10 stirred for 30 min. After that tri ethyl ammonium 
bicarbonate solution (TEAS) (0.1 M; 16 mL) was added 
and the mixture was stirred for 1 h. Then aqueous 
ammonia (28%; 16 mL) was added and the reaction 
mixture was stirred for 12 h. After most liquid was 
15 removed under vacuum, the residue was redissolved in 
water (2 mL) and filtered. The aqueous solution was 
purified by DEAE Sephadex A25 ion exchange column 
using gradient aqueous TEAS solution (from 0.1 M to 
1.0 M) as eluent to afford 22 as colorless syrup after 
20 evaporation: NMR (300 MHz, D2O) 5 8.43 (s. IH, 6-H) . 
6.21 (t, J » 6.7 Hz, IH, I'-H), 5.85-6.00 (m, IH, 
CH2CH=CH2), 5.28-5.38 (dm, a « 17.3 Hz, IH, one of 
CHaCH^CHi) , 5.19-5.27 (dm, J = 10.4 Hz, IH, one of 
CHaCH^CHj), 4.22-4.41 (m, 3H, 3' -H and CsCCi/2) , 4.05- 
25 4.18 (m, 3H, 4 ' -H and CHaCH^CHa) , 3.94-4.01 (m, 2H, 5' - 
H), 2.47-2.59 (m, IH, one of 2'-H), 2.20-2.32 (m, IH, 
one of 2'-H); ^^P NMR (121.4 MHz, D2O) 5-7.1 (d, J" 

19.8 Hz, IP, /.p), -11.1 (d, jr = 19,1 Hz, IP, a-P), - 

21.9 (t, J = 19.5 Hz, IP, >9-P) . 
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3' -O-Allyl-dCTP-allyl-Bodipy-FL-510 (22). To a stirred 
solution of allyl-Bodipy-FL-510 NHS ester 12 in DMF 
(300 tih) , 3'-0-allyl-dCTP-NHa 22 in IM NaHCOa-NajCOa 
buffer (300 /iL; pH 8.7) was added. The reaction 
5 mixture was stirred at room temperature for 7 h and 
the crude product was purified by a preparative TLC 
plate using CH3OH-CH2CI2 (1:1) as the eluent. The crude 
product was further purified by reverse -phase HPLC 
using a 150x4.6 mm 018 column to afford compound 23 
10 (retention time « 35 min) . Mobile phase: A, 4.4 mM EtjN 
/ 98-3 mM 1, 1, 1 , 3, 3 , 3-hexaf luoro-2-propanol in water 
(pH = 8.1); B, methanol. Elution was performed from 
100% A isocratic over 10 min followed by a linear 
gradient of 0-50% B for 20 min and then 50% B 
15 isocratic over 20 min. The product was characterized 
by the following single base extension reaction to 
generate DNA extension product 24 and characterized by 
MftUDI-TOF MS. 



2 0 A general procedure of primer extension using 3'-o- 
allyl-dNTPs-allyl-Pluorophore. The polymerase 

extension reaction mixture consisted of 50 pmol of 
primer (5' -GTTGATGTACACATTGTCAA-3 ' ) , 80 pmol of 100- 
"^^^ template (5#« 

2 5 TACCCGGAGGCCAAGTACGGCGGGTACGTCCTTGACAATGTGTACATCAACATC 
ACCTACCACCATGTCAGTCTCGGTTGGATCCTCTATTGTGTCCGGG-3' ) , 
120 pmol of 3'-0-allyl-dNTP-allyl-Fluorophore, ix 
Thermopol II reaction buffer [20 mM Tris-HCl/ 
10 mM (NH4)2SO4/10 mM KCl/ 2 mM MnCla/O.l % Triton X-100, 
pH 8.8, New England Biolabs] , and 6 units of 9° n 
Polymerase (exo- ) A485L/y409V in a total volume of 20 
fill. The reaction consisted of 2 0 cycles at 94 '^c for 
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20 sec, 46 »C for 40 sec, and 60 *C for 90 sec. After 
the reaction/' a small portion of the DNA extension 
product was desalted using a ZipTip and analyzed by 
MALDI-TOF MS, which showed a dominant peak at m/z 5935 
5 corresponding to the DNA extension product generated 
by incorporating 23 . The rest of the product was 
subjected to the following deallylation. 

General one-pot dual -deallylation procedure of DNA 
10 extension products. The DNA product from above (20 
pmol) was added to a mixture of degassed IX Thermopol 
I reaction buffer (20 mM Tris-HCl/lO mM (NH4)2SO4/10 mM 
KCl/2 mM MgSO4/0.1 % Triton X-100, pH 8.8, 1 nL) , 
NaaPdCl* in degassed HjO (7 |iL, 23 nmol) and P(PhS03Na)3 
15 in degassed HjO (10 ^iL, 176 nmol) to perform an one-pot 
dual -deallylation reaction. The reaction mixture was 
then placed in a heating block and incubated at 70 
for 3 0 seconds to yield quantitatively deallylated DNA 
product and characterized by MALDI-TOF MS as a single 
20 peeJc. 
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£xi«n8ion with ^-O-aJiyl-dNTP-ayy^Oye on a mpaal G tamptata: 



Ptimm S'-TCTCTGGCCGCGTGTCT 



-y 23. 27. at. 43 5'- 



GGAA -5' 9»>SiDMApoJyn»«M 3'. 



Dnt^aon 3*- 



•C'-OH -3* 



(2) 




C«ied: 



OH 



Caledb 
S48S 



-C-aHyl -3* 
iGAA -5- 




5 



5. Synthesis of 3 ' -O-allyl-dUTP-allyl-RSG (27) 



10 



15 




HN 

d'o-o'WV \^ OMFyNaHCOj-NaaCO, 



O OrO OX> Or J-^ 




3'-aalfyl-dUTP.aflyM6G 77 



Synthesis of 3 ' -O-allvl-HUTP-Tow 

A±±yj, auTP-NHz 26 was performed 

according to the procedures in reference (28) , 



was 



3'-0-Allyl.dUTP-allyl-R6G (27). The procedure 
similar to the synthesis of 23. The product wal 
characterized by the single base extension reaction 
and MALDI-TOF MS in Fig. 9. 
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6. Synthesis of 3' -O-allyl-dATP-allyl-ROX (39) 




a 



I TolO 32 
1)KOM/CHjCN 



2)NH>/CH,OH 
31 71% 



W^** TBDMSa 



ON 
33 



TBAB 



CHjCb/NaOHfaq) 
9S% 



i)TBAF/TMF 




NHCOCF9 



O 

as 



OH 
34 



^f^NMCOCF, 



Pd(PPh,)4/C«l 
ElaN / OMF 



l}POCl9/PO(OM«)^ 

3) 8u,N/TEAa 

4) NNs(aq) 



36 

6/. 



90% 



OyM^X NHS 



d' 



f 



I 



'0*_^0»^^0^^^0 




. ' 



ir 

o 



3'-0-ariyl-dATP.ally|.R0X 39 



5 6-Chloro-7-iodo-7-dea2apurine (31). To a vigorously 
stirred solution of 30 (l.o g; 6.51 mmol) in CH.Cl, (55 
mD, W-iodosuccimide (1.70 g; 7. 18 mmol) was added. 
The suspension mixture was stirred at room temperature 
for 1 h. during which more precipitate was observed. 
10 The precipitate was filtered and then recrystallized 
in hot methanol to afford 31 (1.49 g; 82% yield): »H 
NMR (400 MHz, DMSO-d6) 5 12.96 (br s, IH, NH) , 8.59 (s. 
IH, 2-H). 7.94 (s, IH, 8-H) ; NMR (100 MHz. DMSO-d6) 

5 151.2, 150.4. 150.2. 133.6, 115.5, 51.7; HRMS (FAB*) 
15 calcd for CeH.N,ClI (M^H*) : 279.9139. found: 279.9141. 
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6-Chloro-9- (^-D-2' -deoxyribofuranosyl) -7-iodo-7- 
deazapurine (33). To a stirred solution of 31 (707 mg; 
2.53 minol) in CHjCN (43 mL) , KOH powder (355 mg; 6.34 
nanol) and tris [2- (2-methoxyethoxy) ethyl] amine (TDA-1) 
(52 ML, 0.165 mmol) were added. The mixture was stirred 
at room temperature for 10 min and then 3,5-di-o-(p- 
toluyl) -2-deoxy-D-ribofuranosyl chloride 32 (i.ia g; 
2.73 mmol) was added. The reaction mixture was stirred 
vigorously at room temperature for 1 h, and the 
insoluble portion was filtered and washed with hot 
acetone. The combined solution was evaporated and 
dissolved in 7M ammonia in methanol solution (86 mL) . 
The mixture was stirred at room temperature for 24 h. 
After evaporation, the crude product was purified by 
flash column chromatography using CHjOH-CHjClj (0-1:20) 
as the eluent to afford 33 as white solid (711 mg; 71% 
yield): NMR (400 MHz. CD3OD) 5 8.57 (s, IH, 2-H) . 

8.08 (s, IH, 8-H). 6.72 (dd, J» 6.3, 7.5 Hz, IH, 1'- 
H), 4.53 (m, IH, 3'-H), 4.00 (m, IH, 4'-H). 3.80 (dd, 
J- = 3.6, 12.0 Hz, IH. one of S'-H), 3.74 (dd, J= 3.6,' 
12.0 HZ, IH, one of 5'-H). 2.56-2.64 (ddd, j = e.i, 
7.5, 13.5 H2, IH, one of 2'-H), 2.36-2.43 (ddd, J = 
3.3, 6.2. 13.5 Hz, IH, one of 2'-H); "c NMR (loo MHz, 
CD3OD) 5 152.9. 151.7. 151.3, 134.7. 118.5. 89.0. 85.7.' 
72.6. 63.2. 52.6. 41.7; HRMS (FAB+) calcd for 
CixH^OaNaClI (M+H*) : 395.9612. found: 395.9607. 

9- [fi-D-S' -O- (tejTt-Butyldimethylsilyl) -2' - 
30 deoxyribof uranosyl J -6-chloro- 7-iodo7-deazapurine (34) . 
The procedure was similar to the synthesis of is. 
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5 



10 



15 



20 



25 



30 



the crude product was purified by flash colunm 
chromatography using ethyl acetate-hexane (1:3-2) as 
the eluent to afford 34 as white solid (597 mg; 65% 
yield) and 33 (213 mg; 3 0% yield) . The above procedure 
was repeated with the recovered 33 to achieve a 86% 
overall yield of 34: NMR (400 MHz, CD3OD) 6 8 . 56 (s, 
IH, 2-H), 7.99 (s, IH. 8-H) , 6.73 ('t', J- 6.7 Hz, IH, 
I'-H), 4.52 (m, IH, 3'-H), 4.02 (m, IH, 4'-H), 3.92 
(dd, J = 3.0, 11.4 Hz, IH, one of 5'-H), 3.86 (dd, J = 
3.1, 11.4 Hz, IH, one of 5'-H), 2.47-2.55 (ddd, J" = 
5.8, 7.1, 13.4 Hz, IH, one of 2' -H) , 2.40-2.47 (ddd. J" 
" 3.6, 6.3, 13.4 Hz, IH, one of 2'-H), 0.94 (s, 9H, 
C(C«3)3). 0.14 (s, 3H, one of SiCH,) , 0.13 (s, 3H, one 
of SiCJ/3); "C NMR (100 MHz, CD3OD) 5 152.8, 151.5, 
151.3, 133.8, 11.8.2, 88.9, 85.4, 72.5, 64.6, 52.6, 
42.4. 26.7, 19.5, -4.9, -5.0; HRMS (FAB+) calcd for 
Ci7H2603N3ClSiI (M+H*) : 510.0477, found: 510.0487. 

9- [^-D-3' -0-Allyl-5'-o-(tert-butyldimethylsilyl)-2'- 
deoxyribofuranosyl] -6-chloro-7-iodo-7- deazapurine 
(35). To a stirred solution of 34 (789 mg; 1.55 mmol) 
in CHaCla (48 mL) . tetrabutylammoniura bromide (TBAB) 
(255 mg; 0.77 mmol). allyl bromide (0.69 mL, 7.74 mmol) 
and 40% aqueous NaOH solution (24 mL) were added 
respectively. The reaction mixture was stirred at room 
temperature for ih. Ethyl acetate (150 mL) was added 
and the organic layer was separated. The aqueous layer 
was extracted with ethyl acetate (2x50 mL) . The 
combined organic layer was washed with saturated 
aqueous NaHCOj, NaCl, and dried over anhydrous Na^so,. 
After evaporation, the residue was purified by flash 
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column chromatography using ethyl acetate- hexane (1:6) 
as the eluent to afford 35 as yellow oil (809 mg; 95% 
yield): NMR (400 MHz, CDjOD) S 8.52 (s, IH. 2-H) , 

7.94 (s, IH, 8-H), 6.64 (dd, J = 6.1, 7.6 Hz, IH, 1'' 
H), 5.88-5.99 (m, IH, CHjCH^CH,) , 5.28-5.34 (dm, J- = 
17.3 Hz, IH, one of CH2CH=CH2) , 5.16-5.21 (dm, J - 10. 4 
HZ, IH, one of CH,CH=C//,) , 4.28 (m, IH. 3'-H), 4.13 (m, 

IH, 4'-H), 4.01-4.11 (m, 2H, Ci/aCH»CH,) , 3.88 (dd, J", 

3.6, 11,2 Hz, IH, one of 5'-H), 3.80 (dd, J = 3.1, 

II. 3 Hz, IH, one of 5'-H), 2.51-2.57 (ddd, J - 2.7, 
6.0, 13.5 Hz, IH, one of 2'-H), 2.42-2.50 (ddd, J » 

5.7, 7.7, 13.5 Hz, IH, one of 2'-H), 0.93 (s. 9H, 
C(Ci/3)3), 0.13 (s, 3H, one of SiCHj) , 0.12 (s. 3H, one 

of SiCHa); "C NMR (100 MHz, CD3OD) 5 152.8, 151.4, 
151.3, 135.5. 133.6, 118.2. 117.2, 86.5, 85.6, 80.2. 
71.0. 64.8, 52.8, 39.7, 26.7, 19.4, -4.8, -5.0; HRMS 
(FAB+) calcd for CaoHaoOaNaClSil (M+H*) : 550.0790. found: 



550 . 0773 . 



20 3'-0-Allyl-7-deaza-7-iodo-2'.deoxyadenosine (36). To a 
stirred solution of 35 (809 mg.- 1.47 ramol) in 
anhydrous THF (34 mL) . i m TBAF in THF solution (1.62 
mL; 1.62 mmol) was added and the reaction was stirred 
at room temperature for i h. After evaporation, the 

IS residue was dissolved in 7M ammonia in methanol 
solution (24 mL). The solution was stirred in an 
autoclave at 115-120 for 15 h. After evaporation, 

the residue was purified by flash column 
chromatography using CHjOH-CHaCl, (1:20) as the eluent 

0 to afford 36 as white solid (514 mg; 84% yield) : ^H NMR 
(400 MHZ. CD3OD) 5 8.08 (s. IH. 2-H), 7.56 (s. IH. 8-H) 
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6.45 (dd. J= 5.8, 8.6Hz. IH. I'-H). 5.90-6.00 (m. IH. 
CH,Ci/=CH,). 5.29-5.35 (dm, J » 17.2 Hz, IH, one of 
CH,CH-CH,), 5.16-5.21 (dm, J = 10.5 Hz. IH, one of 
CH,CH=Cjy,) . 4.28 (m. IH. 3'-H), 4.12 (m. IH. 4'-H). 
4.02-4.12 (m, 2H. CHjCH^CHa). 3.78 (dd. 3.7. 12.1 

HZ. IH, one of 5'-H), 3.70 (dd, j" = 3.6. 12.1 Hz, IH. 
one of S'-H). 2.53-2.61 (ddd. j = 5.8. 8.6. 13.6 Hz, 
IH. one of 2'-H). 2.41-2.47 (ddd, J = 2.0, 5.8. 13.5 
HZ. IH. one of 2'-H); "c NMR (100 MHz, CDsOD) 6 158.5, 
152.3. 150.3. 135.7. 128.8. 117.0. 105.3. 86.8. 86.4.' 
80.7. 71.0, 63.7, 51.3. 38.8; HRMS (PAB+) calcd for 
C14H1BO3N4I (M+H*) : 417.0424, found: 417.0438. 

3' -0-Allyl-7-deaza-7- {3- [ (trif luoroacetyl) aminoj prop- 
15 l-ynyl}-2'-deoxyadenosine (37). The procedure was 
similar to the synthesis of 21. and the crude product 
was purified by flash column chromatography using 
ethyl acetate-hexane (l:i>o) as the eluent to afford 
37 as yellow solid (486 mg; 90% yield) : ^h NMR (400 MHz. 
CD3OD) 5 8.08 (s. IH, 2-H). 7.60 (s. IH, 8-H) , 6.41 (dd! 
-J = 5.8, 8.6 Hz, IH, I'-H), 5.89-6.00 (m, IH 
CH.CH=CH,), 5.29-5.35 (dm, J , 17.3 Hz, IH, one of 
CH3CH=Ci/,), 5.16-5.21 (dm. J = 10.4 Hz. IH. one of 
CH,CH=CH,), 4.31 (s, 2H. C^CH,) , 4.29 (m. IH. 3 • -H) , 
4.13 (m, IH. 4'-H), 4.01-4.11 (m. 2H. CH,CH=CHa) . 3.79 
(dd, J- = 3.6, 12.1 HZ, IH. one of 5'-H), 3.71 (dd, J , 
3.5, 12.1 Hz, IH, one of 5' -H) , 2.54-2.62 (ddd. J = 
5.8. 8.6. 13.6 HZ. IH. one of 2'-H). 2.42-2.48 (ddd. J 
» 1.9. 5.8. 13.6 Hz. IH. one of 2'-H); "c NMR (100 MHz 
CD3OD) 5 158.8. 158.6 (q, a = 38 Hz, COCF3) . 152.9 ' 
149.6. 135.6. 128.1. 117.1 (q, j . 334 Hz. COCF3) 
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117.0, 104.5, 96.3, 87.3, 86.9, 86.8, 80.7. 77.0, 71,0, 
63.8, 38.7, 31.1; HRMS (FAB+) calcd for Ci9H2i04N5Fa 
(M+H'") : 440.1546, foxrnd: 440.1544. 

5 3' -O-Allyl-7- (3-aminoprop- 1-ynyl ) -7-deaza-2' - 

deoxyadenosine-5' -triphosphate (38). The procedure was 
similar to the synthesis of 22 to yield 38 as 
colorless syrup: NMR (3 00 MHz, D2O) 5 8.02 (s, IH, 
2-H) , 7.89 (s, IH, 8-H), 6.54 (t, Cf^ 6.6 Hz, IH, 1'- 

10 H) , 5.89-6.02 (m, IH, CHaCif^CHa) , 5.30-5.39 (dm, J" = 
17.3 Hz, IH, one of CH2CH=CH2) , 5.20-5.27 (dm, J ^ 10.4 
Hz, IH, one of CHaCH^Cft) , 4.48 (s, 2H, CsCCifa) , 4.35 (m, 
IH, 3'-H), 4.05-4.17 (m, 4H. CHaCH=CHa and S'-H), 3.99 
(m, IH, 4'-H), 2.50-2.59 (m, 2H, 2'-H); ^^P NMR (121.4 

15 MHz, D2O) 5 -6.1 (d, iJ = 21.1 Hz, IP, r-P) r -10.8 (d, J 
- 18.8 Hz, IP, a-P), -21.9 (t, iJ = 19.9 Hz, IP, /J-P) . 

3' -O-Allyl-dATP-allyl-ROX (39). The procedure was 
similar to the synthesis of 23. The product was 
20 characterized by single base extension reaction and 
MALDI-TOF MS. See Fig. 10. 

7. Synthesis of 3' -O-allyl-dGTP-allyl-Bodipy-650 (43) 
and 3' -O-allyl-dGTP-allyl-CyS (44) 
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HH 



•OylMDlPY-aM NHS MMr 



■o, 
o 



42 ^""^ 




•aal»yMGTP-allyl-aadipy-650 43 



so, 



or- 



OMF I MaHCOs -Na^CO, 




3'-0-a«yl-dGTP>aUyl^y5 44 



Synthesis of 3' -0-allyl-dGTP-NH2 42 was performed 
according to the procedures in reference (29) . 

5 

3' -O-Allyl-dGTP-allyl-Bodipy-650 (43) and 3' -O-ailyl- 
dGTP-allyl-Cy5 (44) • The procedures were similar to 
the synthesis of 23 . The product was characterized by 
single base extension reaction and MALDI-TOF MS. See 
10 Fig. 11. 



II. Synthesis of 3 ' -O-allyl-dNTPs 



15 1. Synthesis of 3'-0-allyl dCTP (51) 



wo 2008/069973 



PCT/US2007/024646 




S' -O- ( tejTt-Butyldimethylailyl) -2 ' -deoxycytidine (48) . 
To a stirred solution of 2' -deoxycytidine 47 (l.oo g; 
5 4.40 mmol) in dry pyridine (37 mL) , TBDMSCI (814 mg; 
5.39 mmol) was added. The mixture was stirred at room 
temperature for 20 h. After evaporation, the residue 
was purified by flash column chromatography using 
CH3OH-CH2CI2 (1:10) as the eluent to afford 48 as white 

10 solid (1.26 g; 84% yield): NMR (400 MHz, CD3OD) S 

8.03 (d, J = 7.5 H2, IH, 6-H), 6.23 (t, J = 6.3 Hz, IH, 
I'-H), 5.86 (d, J- = 7.5 Hz. IH, S-H) , 4.35 (m, IH, 3'- 
H) , 3.91-3.98 (m, 2H, 4'-H and one of 5' -H) , 3.85 (dd, 
*7 = 2.5, 11.3 Hz, IH, one of 5'-H), 2.36-2.43 (ddd. J 

15 = 4.1, 6.1, 13.5 Hz, IH, one of 2'-H), 2.05-2.13 (m, 
IH, one of 2'-H), 0.94 (s, 9H, C(CJf3)3), 0.14 (s, 3H, 
one of SiCHi) . 0.13 (s, 3H, one of SiCHj) ; "c NMR (100 
MHZ, CD3OD) 5 167.1, 157.6, 142.1, 95.6, 88.7, 87.4, 
71.7, 64.1, 42.7, 26.5, 19.3, -5.2, -5.3; HRMS (FAB+) 

20 calcd for CisHasO^NjSi (M+H*) : 342.1849, found: 342.1844. 



3' -O-Allyl-5' -O- (tert-butyldimethylsilyl) -2' - 
deoxycytidine (49) . The procedure was similar to the 
synthesis of 19 and the crude product was purified by 
25 flash column chromatography using CHaOH-CHjClj (1:20) 
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as the eluent to afford 49 as yellow solid (480 mg; 
35% yield): NMR (400 MHz, CD3OD) 3 7.97 (d, J" = 7.5 
Hz, IH, 6-H), 6.21 (t, J= 6.5 Hz, IH, I'-H), 5.87 (d, 
J = 7.5 Hz, IH, S-H) , 5.87-^.97 (m, IH, CHaCJfeCHa) , 
5 5.26-5.33 (dm, J" = 17.2 Hz, IH, one of CHaCH^CHa) , 
5.15-5.20 (dm, J = 10.5 Hz, IH, one of CHaCHsCifa) , 4.16 
(m, IH, 3'-H), 4.11 (m, IH, 4'-H), 3.97-4.11 (m, 2H, 
CHaCH=CHa) , 3.92 (dd, jr » 3.2, 11.4 Hz, IH, one of 5'- 
H) , 3.84 (dd, J- » 2.8, 11.4 Hz, IH, one of 5' -H) , 
10 2.46-2.51 (ddd, a = 3.1, 5.9, 13.6 Hz, IH, one of 2'- 
H) , 2.00-2.08 (m, IH, one of 2'-H), 0.94 (s, 9H, 
C(CH3)3), 0,13 (s, 6H, Si(CH3)a); "CNMR (100 MHz, CD3OD) 
6 167.2, 157.7, 141.9, 135. 6, 117.1, 95.7, 87.5. 86.6, 
79.7, 71.1, 64.4, 39.8, 26.5, 19.3, -5.1, -5.2; HRMS 
15 (FAB+) calcd for Ca5H2«04N3Si (M+H*) : 342.1849. found: 
342.1844. 



3' -O-Allyl-2' -deoxycytidine (50). The procedure was 
similar to the synthesis of 20 and the crude product 

20 was purified by flash column chromatography using 
CH3OH-THF (1:12) and CHjOH-ethyl acetate (1:4) as the 
eluent to afford 50 as white foam (269 mg; 80% yield) : 
'H NMR (400 MHZ, CD3OD) 5 7.96 (d, J = 7.5 Hz, IH. 6-H), 
6.21 (dd, J » 5.9, 7.6 H2, IH, I'-H), 5.89 (d, J = 7.5 

25 Hz, IH, 5-H), 5.87-5.98 (ra, IH, CHaCH^CHa) , 5 . 27 - 5 . 33 
(dm, J- = 17.3 Hz. IH, one of CH2CH=CHa) , 5.15-5.19 (dm, 
J- = 10.4 HZ, IH, one of CHaCH=CHa) , 4.17 (m, IH, 3'-H), 
3.98-4.10 (m, 3H, 4'-H and CH2CH=CH2) , 3.77 (dd, J = 
3.6, 12.0 Hz, IH, one of 5'-H). 3.71 (dd. jr = 3.7, 
30 12.0 Hz, IH, one of 5' -H) , 2.43-2.50 (ddd, J" = 2.7. 
5.9, 13.6 H2, IH, one of 2'-H), 2.03-2.11 (ddd, J = 
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6.2. 7.7, 13.6 HZ, IH, one of 2'-H); "c NMR (100 MHz. 
CD3OD) 6 167.1, 157.7. 142.2. 135.5, 117.0. 96.0. 87.5,' 
86.6, 80.0, 71.0. 63.0, 39.1; HRMS (FAB+) calcd for 
CijHia04N3 (M+H*) : 268.1297, foxond: 268.1307. 

3' -O-Allyl.2' -deoxycytidine-5' -triphosphate (51) . 50 
(86 mg; 0.32 tnmol) and proton sponge (84 mg; 0.39 mmol) 
were dried in a vacuum desiccator over P.Os overnight 
before dissolving, in trimethylphosphate (0.60 mL) . 
Freshly distilled POCl, (35.8 mL; 0.383 mmol) was added 
dropwise at 0 -c and the mixture was stirred for 3 h. 
Then the solution of tributylammonium pyrophosphate 
(607 mg) and tributylamine (0.61 mL; 2.56 mmol) in 
anhydrous DMF (2.6 mL) was well vortexed and added in 
one portion at room temperature and the reaction was 
stirred for 3 0 min. After that tri ethyl ammonium 
bicarbonate solution (teab) (0.1 M; 16 mL) was added 
and the mixture was stirred for 2 h. After most liquid 
was removed under vacuum, the residue was redisaolved 
in water (2 mL) and filtered. The aqueous solution was 
purified by DEAE Sephadex A25 ion exchange column 
using gradient aqueous TEAB solution (from 0.1 m to 
1.0 M) as eluent to afford 51 as colorless syrup after 
evaporation: NMR (300 MHz, D,0) 6 7.90 (d. J = 7.4 
Hz, IH, 6-H). 6.20 (dd. J = 5.9. 7.6 Hz. IH, I'-H), 
5.92 (d, J = 7.4 Hz. IH. 5-H). 5.85-5.97 (m, IH 
CH.Ci/-CH.). 5.25-5.34 (m, IH, one of CH,CH=CHa) . 5.15- 
5.20 (m. IH. one of CH,CH=CHa) , 4.15 (m. IH. 3'-H) 
3.96-4.10 (m, 3H, 4 ' -H and CH,CH=CHa) . 3.70-3.80 (m. 2H. 
5'-H), 2.43-2.52 (m. IH. one of 2'-H). 2.05-2.14 (m. ' 
IH. one of 2'-H); "p NMR (121.4 MHz. D,0) 5 -8.8 (d, j 



wo 2008/069973 PCT/US2007/024646 

60 



5 



10 



= 19.0 Hz, IP, y-P) , -11.3 (d, J= 19.6 Hz, IP, a-P) , - 
22.9 (t. J= 19.5 Hz, IP, ^-P) . 



2. Synthesis of 3 ' -O-allyl-dTTP (53) 



o o 

J 1)POCl3/PO(OMe), "7 J 




Synthesis of 3 ' -O-allyl thymidine 52 was performed 
according to the procedures in reference (28) . 



3' -O-AllylthYmidine-5' -triphosphate (53) . The 

procedure was similar to the synthesis of 51 to yield 
53 as colorless syrup: ^H NMR (300 MHz, DjO) S 7.80 (m, 
IH, 6-H) , 6.23 (dd, J « 6.2, 8.1 Hz, IH, I'-H), 5.85- 

15 5.97 (m, IH, CHaCH^CHa) , 5.25-5.32 (m, IH, one of 
CH2CH= CHj ) , 5-15-5.21 (m, IH, one of CHaCH^CHj) , 4.17 (m, 
IH, 3'-H), 3.97-4.11 (m, 3H, 4 ' -H and CHzCH^CHa) , 3.70- 
3.80 (m, 2H, 5' -H) , 2.30-2.41 (m, IH, one of 2' -H) , 
2,11-2.23 (ra, IH, one of 2' -H) , 1.86 (d, J" = 1.2 Hz, 

20 3H, CH3) ; "P NMR (121.4 MHz, D2O) 5 -7.1 (d, cT ^ 20.1 
Hz, IP, y-P), -10.8 (d, J= 19.5 Hz, IP, a-P), -21.8 (t, 
*7 = 19.5 Hz, IP, fi'P) . 



3. Synthesis of 3 ' -O-allyl-dATP (59) 

25 
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^^-i^N HC(OM»,NM», TBDMSa i JL^^ ^ifi^^ TBAB 

^ lir 72% 

OH OH OH 

S4 55 55 

/ f{ \^OJ 2)NH,/CH,0M \^ 3)Bu,N/7EAB ^/V jPv j.V \rOV 

57 55 59^^ 

iy/^- t (Dimethylamino) methylene) -2' -deoxyadenosine (55) . 
To a stirred solution of 2 ' -deoxyadenosine monohydrate 
5 54 (1.00 g; 3.71 mmol) in methanol (43 mL) , N^N- 
dimethyl formamide dimethyl acetal (2-48 mL; 18.6 mmol) 
was added. The reaction was stirred at 50 <»C for 16 h. 
After evaporation, CHaClj-hexane (1:1) was added. The 
white solid formed was then filtered, collected and 
washed by hexane to afford 55 as white solid (1.13 g; 
99% yield): NMR (400 MHz, CD3OD) 5 8.92 (s, IH, 

CfiN(CH3)2), 8.44 (s, IH, 2-H> , 8-43 (s, IH, 8-H) , 6.48 
(dd, Cr « 6.2, 7.8 Hz, IH, I'-H), 4.59 (m, IH, 3'-H), 
4.07 (m, IH, 4'-H), 3.86 (dd, = 3.1, 12.2 Hz, IH, 

one of 5'-H), 3.76 (dd, = 3.5, 12.2 Hz, IH, one of 
5'-H), 3.25 (s, 3H, one of NCH3) , 3.24 (s, 3H, one of 
NCH3), 2.80-2.88 (ddd, J = 5.9, 7.8, 13.5 Hz, IH, one 
of 2'-H), 2.40-2.47 (ddd, J = 2.8, 6.1, 13.4 Hz, IH, 
one of 2'-H); "c NMR (100 MHz, CD3OD) 5 161.0, 159.9, 
152.8, 151.1, 142.8, 127.0, 89.7, 86.9, 72.9, 63.5, 
41.5 (N(CH3)2), 35.3; HRMS (FAB+) calcd for Ci3Hi903Nfi 
(M+H*) : 307.1519, found: 307.1511. 
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5' -O- ( tert-Butyldimethylailyl) -w*- 

{ (dimethylamino) methylene] -2' -deoxyadenosine (56) . The 
procedure was similar to the synthesis of 18, and the 
crude product was purified by flash column 
chromatography using CHjOH-CHjCl, (1:20) as the eluent 
to afford 56 as white foam (l . ii g. 72% yield): NMR 
(400 MHZ, CD3OD) 5 8.90 (s, IH. CW ICH^) ^) , 8.45 (s, IH. 
2-H), 8.43 (s, IH. 8-H), 6.49 (t, J= 6.5 Hz, IH. 1'- 
H), 4.59 (m. IH, 3'-H). 4.04 (m, IH, 4'-H), 3.95 (dd, 
J ^ 3.7, 11.3 Hz, IH, one of 5'-H), 3.85 (dd, J - 2.8, 
11.3 Hz, IH, one of S'-H), 3.25 (s. 3H, one of NCf/3) ' 
3.24 (s, 3H. one of NCH,) , 2.73-2.81 (m, IH, one of 2'- 
H). 2.48-2.55 (ddd, J = 4.O, 6.4, 13.5 Hz, IH, one of 
2'-H), 0.90 (s, 9H, C(CH3)3), 0.08 (s, 3H, one of 
15 SiCH,), 0.07 (s, 3H, one of SiCH,) ; "C NMR (100 MHz, 
CD3OD) 5 160.6. 159.7, 153.0, 151.7, 141.8, 126.5, 88.9, 
85.7, 72.1. 64.3, 41.6. 41.5. 35.3, 26.5, 19.3, -5.0, 
-5.1; HRMS (FAB+) calcd for Ci,H3303N6Si (M+H*) : 421.2383. 
found: 421.2390. 
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3' - O-Allyl-5' -O- ( t-butyldimethylsilyl) -N^- 
l (dimethylamino) methylene] -2 ' -deoxyadenosine (57) . The 
procedure was similar to the synthesis of 35, and the 
crude product was purified by flash column 
chromatography using CH3OH-CH3CI3 (1:25) and CH30H-ethyl 
acetate (l:io) as the eluent to afford 57 as colorless 
oil (875 mg; 72% yield): ^H NMR (400 MHz, CD3OD) 5 8.90 
(s, IH, CiiN(CH3)3), 8.44 (s, IH, 2-H), 8.41 (s, IH, 8- 
H), 6.45 (dd, J- = 6.3, 7.2 Hz. IH, I'-H). 5.91-6.01 (m 
IH. CH,C//=CH,). 5.30-5.37 (dm. J . 17.2 Hz. IH. one of' 
CH.CH=Cif,). 5.18-5.22 (dm. J = 10.5 Hz. IH. one of 
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CHaCH«CHa), 4.37 (m, IH, 3'-H), 4.17 (m, IH, 4'-H), 
4.05-4.15 (m, 2H, Ci/aCHsCHa) , 3.91 (dd, J - 4.6, 11.1 
Hz, IH, one of 5'-H). 3.83 (dd, J = 3.8, ll.l Hz, IH, 
one of 5'-H), 3.25 (s, 3H, one of NCHj) , 3.24 (s, 3H, 
5 one of NCHa) , 2.76-2.83 (ddd, J = 6.0, 7.3, 13.6 Hz, IH, 
one of 2'-H), 2.59-2.65 (ddd, J" = 3.0, 6.1. 13.6 Hz, 
IH. one of 2'-H), 0.90 (a, 9H, CiCH^)^), 0.08 (s, 6H, 
Si (0/^3)2); "C NMR (100 MHz, CD3OD) 5 160.7, 159.7, 
153.1, 151.8, 141.9, 135.6, 126.5, 117.1, 86.7, 85.9, 
10 80.1, 71.1, 64.5, 41.5, 38.7, 35.3, 26.5, 19.3, -5.0, 
-5.1; HRMS (FAB+) calcd for CjaHjiO^NsSi (M+H*) : 461.2696, 
found: 461.2695. 

3' -O-Allyl-2' -deoxyadenosine (58). To a stirred 
15 solution of 57 (875 mg; 1.90 mmolj in anhydrous THP 
(45 itiL) , 1 M TBAF in THF solution (2.09 mL; 2.09 mmol) 
was added and the reaction was stirred at room 
temperature for l h. After evaporation, the residue 
was dissolved in 7 M ammonia in methanol solution (34 
20 mL) . The mixture was then stirred in a sealed flask at 
50 for 9 h. After evaporation, the residue was 

purified by flash column chromatography using CH3OH- 
CH2CI2 (1:10) as the eluent to afford 58 as white solid 
(548 mg; 99% yield): NMR (400 MHz, CD3OD) 6 8,30 (s, 
25 IH, 2-H), 8.17 (s, IH, 8-H) , 6.38 (dd, J ^ 5.8. 8-6 Hz, 
IH, I'-H), 5.91-6.01 (m, IH, CH2CH=CH2) , 5.30-5.37 (dm, 
J ^ 17.3 Hz, IH, one of CHaCH^CHa) , 5.17-5.22 (dm, a ^ 
10.6 Hz, IH, one of CHzCH^Cifz) , 4.36 (m, IH, 3'-H), 
4.21 (m, IH, 4'-H), 4.04-4.15 (m, 2H, CH2CH=CH2) , 3.85 
30 (dd, jr = 3.2, 12.3 Hz, IH, one of 5'-H), 3.74 (dd, J" = 
3.2, 12.3 Hz, IH, one of 5' -H) , 2.75-2.83 (ddd, jr = 
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5.7, 8.6, 13.6 Hz, IH, one of 2'-H), 2.52-2.58 (ddd, J 
= 1.8, 5.8, 13.6 Hz, IH, one of 2'-H); "C NMR (100 MHz. 
CD3OD) 6 157.1, 153.1, 149.5, 141.2, 135.6, 120.6, 
117.0, 87.5, 87.2, 80.9, 71.0, 63.9, 38.7; HRMS (FAB+) 
5 calcd for daHieOaNs (M+H*) : 292.1410, found: 292.1426. 

3' -O-Allyl-2' -deoxyadenosine-5' -triphosphate (59). The 
procedure was similar to the synthesis of 51 to yield 
59 as colorless syrup: ^H NMR (300 MHz, D3O) 5 8.46 (3, 
IH, 2-H), 8.19 (s, IH, 8-H), 6.43 (dd, J - 6.3, 7.2 Hz. 
IH, I'-H), 5.90-6.02 (m, IH, CHaCH-CHj) , 5.31-5.40 (dm, 
J = 17,1 Hz, IH, one of CHjCHsCiia) , 5.21-5.28 (dm, J » 
10.8 HZ, IH, one of CHaCH«CHa) , 4.55 (m, IH, 3'-H) , 
4,40 (m, IH, 4'-H). 4.06-4.20 (m, 4H, Ci/aCHaCHj and 5' - 
15 H), 2.61-2.82 (m, 2H, 2' -H) ; "p NMR (121.4 MHz, D2O) 5 
-8.9 (d, J- = 19.1 Hz, IP, /-P), -11.2 (d. J= 19.7 Hz, 
IP, a-P). -22.8 (t, J = 19.9 Hz. IP, fi-P) . 



10 



20 



4. Synthesis of 3 ' -O-allyl-dGTP (65) 



9 a 
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2-Ainino-6-chloro-9- [fi-D-5' -O- ( tert-butyldimethylsilyl) - 
2' -deoxyribofuranosyl] purine (61). The procedure was 
similar to the synthesis of 18, and the crude product 
was purified by flash column chromatography using 
5 CHjOH-CHaCl2 (1:20) as the eluent to afford 61 as white 
solid (1.20 g; 86% yield): NMR (400 MHz, CD3OD) S 
8.25 (s, IH, 8-H), 6.34 (t, J" = 6.4 Hz, IH, I'-H), 
4.56 (m, IH, 3'-H), 4.01 (m, IH, 4'-H), 3.90 (dd, J= 
3.5, 11.4 H2, IH, one of 5' -H) , 3.84 (dd, J = 3.8,- 
10 11.4 Hz, IH, one of 5'-H), 2.67-2.74 (m, IH, one of 
2'-H), 2.43-2.50 (ddd, J = 4.2, 6.4, 13.5 Hz, IH, one 
of 2'-H), 0.89 (s, 9H, 0(0^3)3), 0.07 (s, 3H, one of 
SiCHi) . 0.06 (s, 3H, one of SiC»,) ; "c NMR (100 MHz, 
CD3OD) 5 161.1, 154.2, 151.2, 142.0, 124.9, 88.9, 85.5, 
15 72.0, 64.3, 41.4, 26.5, 19.3, -5.1 (two SiCHs) ; HRMS 
(FAB+) calcd for Ci«Ha703N5ClSi (M+H*) : 400.1572, found: 
400 . 1561. 



2-Amino-6-chloro-9- ifi-D-3' -O-allyl-5' -O- ( tert- 
20 butyl dimethyls ilyl) -2 ' -deoxyribo-f uranosyl] -purine 

(62). The procedure was the same as that of 35, and 
the crude product 61 converted from 60 was purified by 
flash column chromatography using ethyl acetate-hexane 
(1:2) as the eluent to afford 62 as white solid (832 

25 mg; 63% yield): ^H NMR (400 MHz, CD3OD) 5 8.23 (s, IH, 
8-H), 6.30 (t, J- = 6.7 Hz, IH, 1' -H) , 5.89-5.99 (m, IH, 
CHaCJfcCHa) , 5.28-5.35 (dm, J - 17.3 Hz, IH, one of 
CHzCH-CHa) , 5.16-5.21 (dm, J = 10.5 Hz, IH. one of 
CHjCH=C«2> , 4.33 (m, IH, 3'-H), 4.13 (m, IH, 4'-H), 

30 4.03-4.12 (m, 2H, CHjCH^CHa) , 3.86 (dd, J" = 4.3, 11.2 
Hz, IH, one of 5'-H). 3.81 (dd, J = 3.9, 11.2 Hz. IH, 
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one of 5'-H), 2.68-2.75 (tn, IH, one of 2' -H) , 2.53- 
2.59 (ddd, J = 3.2. 6.2, 13.6 Hz, IH, one of 2' -H) , 
0.88 (s, 9H, C(CH3)3), 0.08 (s. 3H, one of SiCH,) , 0.07 
(s, 3H, one of Sic//,); "c NMR (100 MHz, CD3OD) 5 161.1, 
154.2. 151.2. 141.9. 135.5, 124.9, 117.1. 86.7. 85.6' 
80.0. 71.1, 64.5, 38.7, 26.5. 19.3. -5.1. -5.2; HRMS 
(FAB+) calcd for Ci,H3x03N5ClSi (M+H*) : 440.1885. found: 



440.1870. 
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20 
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3' -O-Allyl-5' -O- (tert-butyldimethylsilyl) -2' - 
deoxyguanosine (63). To a stirred suspension of 95% 
Nan power (223 mg; 8.83 mmol) in anhydrous THF (82 mL) , 
3-hydroxypropionitrile (550 nL; 8.00 mmol) was added' 
and the mixture was stirred at room temperature for 20 
min. Then 62 (832 mg; 1.89 mmol) in anhydrous THF (20 
mL) was added and the mixture was stirred at 40 "C for 
1 h. At room temperature, 80% acetic acid (630 ^L; 
8.83mmol) was added and stirred for 20 min. After 
evaporation, ethyl acetate (loo mL) was added. The 
mixture was washed by saturated aqueous NaHC03, NaCl. 
and dried over anhydrous Na,SO. . After evaporation, the 
residue was purified by flash column chromatography 
using CH30H-CH,C1, (1:20) as the eluent to afford 63 as 
white solid (661 mg; 83% yield): ^H NMR (400 MHz. CD3OD) 
5 7.92 (s. IH. a-H), 6.22 (dd. 6.4. 7.3 Hz, IH, 1'- 

H). 5.89-5.99 (m, IH, CH^CH-CH,) . 5.29-5.35 (dm, J , 
17.3 HZ. IH, one of CH,CH=CH2), 5.17-5.21 (dm, J = 10 5 
HZ. IH. one of CH,CH=CH,) , 4.30 (m. IH, 3' -H) . 4.11 (m 
IH. 4'-H). 4.03-4.12 (m. 2H. ClfeCH^CH^) . 3.79-3.86 (m' 
2H. 5'-H). 2.56-2.64 (ddd. J= 5.9. 7.4, 13.5 Hz. IH 
one Of 2'-H), 2.49-2.55 (ddd, J = 3.0, 6.1. 13.5 Hz' 
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IH, one of 2'-H), 0.91 l8, 9H, C(CH3)3), 0.10 (s, 3H, 
one of SxCHi) , 0.09 (s, 3H, one of SiCHj) ; "c NMR (100 
MHz, CDCI3) 8 158.7, 153.4, 151.0, 135.3, 134.1, 117.2, 
117.1, 85.0, 83.8, 78.7, 70.2, 63.3. 38.2, 26.1, 18.5, 
5 -5.1, -5.3; HRMS (FAB+) calcd for Ci9H3204N5Si (M+H"") : 
422.2224, found: 422.2209. 

3' -O-Allyl-2' -deoxyguanosine (64). The procedure was 
similar to the synthesis of 20 and the crude product 
10 was purified by flash colunui chromatography using 
CH3OH-CH2CI2 (1:10) as the eluent to afford 64 as white 
solid (434 mg; 90% yield) : NMR (400 MHz, CD3OD) S 

7.94 (s, IH, 8-H), 6.22 (dd, J-= 5.9, 8.4 Hz, IH, 1'- 
H) , 5.90-6.00 (m, IH, CHaCH^CHj) , 5.29-5.36 (dm, J = 
15 17.2 Hz, IH, one of CHaCH^Cift) , 5.17-5.21 (dm, J- = 10.5 
Hz, IH, one of CHjCH^CHa) , 4.31 (m, IH, 3* -H) , 4.14 (m, 
IH, 4'-H), 4.03-4.13 (m, 2H, CifeCH^CHa) , 3.80 (dd, 
3.8, 12.0 Hz, IH, one of 5' -H) , 3.72 (dd, J = 3.7, 
12.0 Hz, IH, one of 5'-H), 2.63-2.71 (ddd, J" ^ 5.9, 
20 8.4, 13.6 Hz, IH, one of 2'-H), 2.45-2.52 (ddd, J" « 
2.1, 5.9, 13.6 Hz, IH, one of 2'-H); ^^C NMR (100 MHz, 
DMSO-d6) 5 156.4, 153.4, 150.6, 135.1, 134.8, 116.5, 
116.3, 84.9, 82.6, 79.1, 69.0, 61.8, 36.4; HRMS (FAB+) 
calcd for Ci^HxaO^H^s (M+H^) : 308.1359, found: 308.1358. 

25 

3' -O-Allyl-2' -deoxyguanosine-5' - triphosphate (65) . The 
procedure was similar to the synthesis of 51 to yield 
65 as colorless syrup: ^H NMR (300 MHz, D2O) 5 7.90 (s, 
IH, 8-H), 6.21 (dd, J- = 6.1, 8.1 Hz, IH, I'-H), 5.86- 
30 5.96 (m, IH, CH2Ci/=CH2) , 5.27-5.35 (m, IH, one of 
CH2CH=CH2) , 5.15-5.20 (m, IH, one of CHjCH^CHj) , 4.30 (m 



wo 2008/069973 



PCT/US2007/024646 



68 



IH, 3'-H), 4.15 (m, IH, 4'-H), 4.02-4.14 (m, 2H, 
Ci/aCH^CHj), 3.75-3.85 (m, 2H, 5'-H), 2.60-2.73 (m, IH, 
one of 2'-H), 2.42-2.50 (m, IH, one of 2'-H); "P NMR 
(121.4 MHz, D2O) 5 -10.9 (d, J = 18.9 Hz, IP, y-^) . - 
5 11.3 (d, J ^ 19.6 H2, IP, a-P), -22.9 (t, J" = 19.6 Hz, 
IP, /?-P) . 



10 



30 



III. Construction of a Chip with Immobilized Self- 
priming DNA Template. 

The DNA chip was constructed as shown in Fig. 5 and 
involved the following three steps: 



SynthBsia af the ^Ikyne-functxonalized DNA template. 
The 5' -amino-hairpin DNA templates (GeneLink, NY) 5'- 
15 NH2-TTT-TTG-TTT-TTT-TTT-TCG-ATC-GAC-TTA-AGG-CGC-TTG- 

CGC-CTT-AAG-TCG-3 ' and 5' -NH2-AGT-CAG-TCT-CTC-ATC-TCG- 
ACA-TCT-ACG-CTA- CTC-GTC-GAT-CGG-AAA-CAG-CTA-TGA-CCA- 
TGC-TTG-CAT-GGT-CAT-AGC-TGT-TTC-C-3' were coupled with 
6-heptynoic acid by adding 300 DMSO solution of 6- 
20 heptynoic-NHS ester [succinimidyl N- (6-heptynoate) ] 
(0.8 M) into the 1000 /xL DNA template solution (200 fiM, 
in 0.25 M NasCOa/NaHCOa buffer, pH 9.0). The reaction 
mixture was stirred for 5 h at room temperature to 
produce a tezrminal alkynyl group on the 5' -end of the 
25 hairpin DNA. The resulting alkyne-f unctionalized DNA 
was separated from the excess reagent by size- 
exclusion chromatography using PD-lO columns (GE 
Health, NJ) . Further desalting with an oligonucleotide 
purification cartridge (Applied Biosystems, CA) and 
drying afforded the crude product which was further 
purified by reverse-phase HPLC (Waters HPLC system 
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containing Waters Delta 600 controller, Rheodyne 7725i 
injector and 2996 photodiode array detector, Milford, 
MA) using a C-18 reverse column (Xterra MS C18, 4.6 nuii 
X 50 mm. 2. S Mm) at a flow rate of 0.5 mL/min, with 
detection at 260 nm. and elution with a linear 
gradient of 12-34.5% of B in A over 40 min (A: 4.4 mM 
triethylamine and lOO mM hexaf luoroisopropyl alcohol 
aqueous solution. pH 8.1; B: methanol). The fractions 
containing the desired product were collected and 
evaporated to dryness under vacuum. maldi-tof MS was 
used to characterize the product on a Voyager de 
matrix assisted laser spectrometer (Applied Biosystems. 
CA) using 3-hydroxypicolinic acid as a matrix. 
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15 AzidB functionalizatian of an amine -inodified glass 
surface. The amine -modified glass slide (Corning® gaps 
II) ^as cleaned and pre-treated by immersion into a 
basic solution [N.N-diisopropyl ethylamine 

(DIPEA)/dimethylformamide (DMF) . i:9 (v/V) ] for 30 min. 
The glass slide was then washed with DMF, and 
transferred into the 2 mL DMF coupling solution 
containing loo mM O- (2-a2idoethyl) -O' - [2- (diglycolyl- 
amino) -ethyl] heptaethylene glycol (Fluka. Switzerland) 

Benzotriazole-l-yl-oxy-tris-pyrrolidino-phosphonium 
hexafluorophosphate (PyBOP) (Novabiochem, CA) and 200 
mM DIPEA. The reaction vessel was gently shaJcen for 4 
h at room temperature. The azide functionalized glass 
slide was washed thoroughly with DMF and ethanol. and 
then dried under argon gas stream. 

DNA iarmohilizatlon on the azlde-nu^dified glass surface 
usxng 1,3-dipolar alkyne-azide cyaloaddition chemistry. 
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A coupling mixture was prepared by mixing tetraJcis- 
(acetonitrile) copper (I) hexaf luorophosphate (2 
mM/DMSO) , tris- (benzyltriazolylmethyl) amine (TBTA) (2 
mM/DMSO) , sodium ascorbate (2.6 mM/HaO) and alkynyl DNA 
5 (50 jiM/HjO) with a volumetric ratio of 3:3:2-3:3. This 
coupling mixture was then spotted onto the azide- 
functionalized glass slide in the form of 11. 0 /iL 
drops with the aid of adhesive silicone isolators 
(Grace Bio-Labs, OR) to create uniform spots on the 
10 glass surface. The DNA spotted glass slide was 
incubated in a humid chamber at room temperature for a 
h, then washed with de- ionized water and SPSC buffer 
(50 mM sodium phosphate/1 M NaCl, pH 7.5) for 1/2 h to 
remove non- specif ically bound DNA, and finally rinsed 
15 with dHaO. The formation of a stable hairpin was 
ascertained by covering the DNA spots with ix 
Thertnolpol II reaction buffer (10 mM KCl, 10 mM 
(NH4)2S04, 20 mM Tris-HCl, 0.1% Triton X-lOO, 4 mM MnClj, 
pH 8.8), incubating it in a humid chamber at 95 for 
5 min to dissociate any partial hairpin structure, and 
then cooling slowly for re- annealing. 



20 



IV. Continuous DNA polymerase reaction using four 
chemically cleavable fluorescent nucleotides as 
25 reversible terminators in solution. 

We characterized the four nucleotide analogues 3'-.o- 
allyl-dCTP-aJ-Zyl-Bodipy-FL-SlO, 3 ' -O-aJIyl-dUTP-aJJyJ- 
R6G, 3'-0-aIIyl.dATP-aJlyI-R0X and 3' -O-allyl-dGTP- 
aliy2.Bodipy-650, by performing four continuous DNA- 
extension reactions sequentially using a primer (5'- 
AGAGGATCCAACCGAGAC-3') and a synthetic DNA template 
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(5' - 

GTGTACATCAACATCACCTACCACCATGTCAGTCTCGGTTGGATCCTCTATTGT 

GTCCGG-3') based on a portion of exon 7 of the human 
p53 gene. The four nucleotides in the template 
5 immediately adjacent to the annealing site of the 
primer are 3'-ACTG-5'. First, a polymerase extension 
reaction using a pool of all four nucleotide analogues 
along with the primer and the template was performed 
producing a single base extension product. The 
10 reaction mixture for this, and all sxibsequent 
extension reactions, consisted of 80 pmol of template, 
50 pmol of primer, 100 pmol of 3' -O-allyl-dNTPs-allyl- 
fluorophore, IX Thermopol II reaction buffer, 4 0 nmol 
of Mn and 2 U of 9"^ mutant DNA polymerase (exo-) 
15 A485L/Y409V in a total volume of 2 0 jiL. The reaction 
consisted of 20 cycles at 94°C for 20 sec, 48*^0 for 40 
sec, and 62*'C for 90 sec. Subsequently, the extension 
product, was purified by using reverse-phase HPLC. The 
fraction containing the desired DNA product was 
20 collected and freeze -dried for analysis using MALDI- 
TOF mass spectrometry. For deallylation, the purified 
DNA extension product bearing the fluorescent 
nucleotide analogue was resuspended in degassed water 
and added to a deallylation cocktail [ix Thermopol I 
25 reaction buf f er/Na2PdCl4/P (PhSOjNa) 3I and incubated for 
3 0 s to yield deallylated DNA product which was 
characterized by MALDI-TOF MS. The DNA product with 
both the fluorophore and the 3'-0-allyl group removed 
to generate a free 3' -OH group was used as a primer 
for a second extension reaction using 3'-o-aIIyl- 
dNTPs-allyl-fluorophore. The second extended DNA 
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product was then purified by HPLC and deallylated. 
The third and the fourth extensions were carried out 
in a similar manner using the previously extended and 
deallylated product as the primer. 

V. 4 -Color SBS reaction on a chip with four chemically 
cleavable fluorescent nucleotides as reversible 
terminators . 

Ten microliters of a solution consisting of 3'-o- 
allyl-dCTP-allyl-Bodipy-FL-BlO (3 pmol) , 3' -O-allyl- 
dUTP -a2iyI-R6G (10 pmol), 3 ' -O-allyJ-dATP-aJIyJ-ROX (5 
pmol) and 3 ' -O-aiiyl-dGTP-allyl-CyS (2 pmol), 1 U of 
9*^ mutant DNA polymerase, and IX Therraolpol ll 
reaction buffer was spotted on the surface of the chip, 
15 where the self -primed DNA moiety was immpbilized. The 
nucleotide analogue complementary to the DNA template 
was allowed to incorporate into the primer at 68*'C for 
10 min. To synchronize any unincorporated templates, 
an extension solution consisting of 30 pmol each of 
3'-0-allyl-dCTP, 3 ' -O-allyl-dTTP, 3 ' -O-allyl-dATP and 
3'-0-allyl-dGTP, 1 U of 9^ mutant DNA polymerase, and 
IX Thermolpol II reaction buffer was spotted on the 
same spot and incubated at sa^'c for 10 min. After 
washing the chip with a SPSC buffer containing 0.1% 
25 Tween 20 for 5 min, the surface was rinsed with dHjO, 
dried briefly and then scanned with a 4-color 
ScanArray Express scanner (Perkin^Elmer Life Sciences) 
to detect the fluorescence signal. The 4-color scanner 
is equipped with four lasers with excitation 
wavelengths of 488, 543, 594, and 633 nm and emission 
filters centered at 522, 570, 614, and 670 nm. For 
deallylation, the chip was immersed in a deallylation 
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cocktail [IX Thertnolpol I reaction 

buffer/Na2PdCl4/P(PhS03Na)3] and incubated for 5 min at 
60 "^C. The chip was then immediately immersed in a 3 M 
Tris-HCl buffer (pH 8.5) and incubated for 5 min at 60 
5 ^'C. Finally, the chip was rinsed with acetonitrile/dH20 
(1:1, V/V) and dH20. The chip surface was scanned again 
to compare the intensity of fluorescence after 
deallylation with the original fluorescence intensity. 
This process was followed by the next polymerase 
10 extension reaction using 3 ' -O-allyl-dNTPs-allyl- 
fluorophore and 3' -O-allyl-dNTPs, with the subsequent 
washing, fluorescence detection, and deallylation 
processes performed as described above. The same cycle 
was repeated multiple times using the four chemically 
15 cleavable fluorescent nucleotide mixture in polymerase 
extension reaction to obtain de novo DNA sequencing 
data on various different DNA templates. 
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(b) 
(c) 



What is claimed is: 

1. A method for determining the sequence of a DNA 
comprising performing the following steps for each 
residue of the DNA to be sequenced: 

(a) contacting the DNA with a DNA polymerase in the 
presence of (i) a primer and (ii) four nucleotide 
analogues under conditions permitting the DNA 
polymerase to catalyze DNA synthesis, wherein (i) 
the nucleotide analogues consist of an analogue 
of dOTP, an analogue of dCTP, an analogue of dTTP 
or dUTP, and an analogue of dATP, (2) each 
nucleotide analogue comprises (i) a base selected 
from the group consisting of adenine, guanine, 
cytosine, thymine or uracil, and analogues 
thereof. (ii) a deoxyribose. (iii) a moiety 
cleavably linked to the 3' -oxygen of the 
deoxyribose and (iv) a unique label cleavably 
linked to the base, so that a nucleotide analogue 
complementary to the residue being sequenced is 
incorporated into the DNA by the DNA polymerase, 
and (3) each of the four analogues has a unique 
label which is different than the unique labels 
of the other three analogues; 
removing unbound nucleotide analogues; 
again contacting the DNA with a DNA polymerase in 
the presence of (i) a primer and (ii) four 
reversible terminators under conditions 

permitting the DNA polymerase to catalyze DNA 
synthesis, wherein (l) the reversible terminators 
consist of an analogue of dGTP, an analogue of 
dCTP, an analogue of dTTP or dUTP, and an 
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analogue of dATP, (2) each nucleotide analogue 
comprises (i) a base selected from the group 
consisting of adenine, guanine, cytosine, thymine 
or uracil, and analogues thereof, which base does 
not have a unique label bound thereto, (ii) a 
deoxyribose, and (iii) a moiety cleavably linked 
to the 3' -oxygen of the deoxyribose; 

(d) removing unbound reversible terminators; 

(e) determining the identity of the nucleotide 
analogue incorporated in step (a) via determining 
the identity of the corresponding unique label, 
with the proviso that step (e) can either precede 
step (c) or follow step (d) ; and 

(f ) following step (e) , except with respect to the 
final DNA residue to be sequenced, cleaving from 
the incorporated nucleotide analogues the unique 
label, if applicable, and the moiety linked to 
the 3 '-oxygen atom of the deoxyribose, 

thereby determining the sequence of the DNA. 

2. The method of claim 1, wherein step (e) is performed 
before step (c) . 



3 , 



4 . 



The method of claim 1. wherein the moiety cleavably 
linked to the 3 '-oxygen of the deoxyribose is 
chemically cleavable or photocleavable . 

The method of claim 1, wherein the moiety cleavably 
linked to the 3' -oxygen of the deoxyribose in the 
nucleotide analogs of step (a) is an allyl moiety or a 
2-nitrobenzyl moiety. 
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5. The method of claim 1. wherein the moiety cleavably 
linked to the 3' -oxygen of the deoxyribose in the 
reversible terminators of step (c) is an allyl moiety 
or a 2-nitrobenzyl moiety. 



6. 



7. 



The method of claim 1. wherein the lanique label is 
bound to the base via a chemically cleavable or 
photocleavable linker. 

The method of claim l. wherein the unique label bound 
to the base via a cleavable linker is a dye. a 
fluorophore, a chromophore, a combinatorial 
fluorescence energy transfer tag, a mass tag, or an 
elect rophore . 



a. The method of claim 3. wherein the moiety i 
chemically cleavable with Na^PdCl*/? (PhSOaNa) 3 . 



s 



9. The method of claim 6. wherein the linker is 
chemically cleavable with Na2PdCl4/P (PhSOjNa) 3 . 

10. The method of claim 1, wherein the primer is a self- 
priming moiety. 

11. The method of claim 1. wherein the DNA is bound to a 
solid substrate. 



12. The method of claim 11, wherein the DNA is bound to 
the solid substrate via 1,3-dipolar azide-alkyne 
cycloaddition chemistry. 
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13, The method of claim 11, wherein the DNA is bound to 
the solid substrate via a polyethylene glycol molecule 

14. The method of claim 13, wherein the DNA is alkyne- 
labeled. 



15. The method of claim 11, wherein the DNA is bound to 
the solid substrate via a polyethylene glycol molecule 
and the solid substrate is azide-functionalized. 



16 



The method of claim 11. wherein the DNA is immobilized 
on the solid substrate via an azido linkage, an 
alkynyl linkage, or biotin-streptavidin interaction. 

17. The method of claim 11. wherein the solid substrate is 
in the form of a chip, a bead, a well, a capillary 
tube, a slide, a wafer, a filter, a fiber, a porous 
media, or a column. 

18. The method of claim 11, wherein the solid substrate is 
gold, quartz, silica, plastic, glass, diamond, silver, 
metal, or polypropylene. 

19. The method of claim 11. wherein the solid substrate is 
porous . 

20. The method of claim ll. wherein about 1000 or fewer 
copies of the DNA are bound to the solid substrate. 

21. The method of claim i. wherein the four nucleotide 
analogues in step (a) are 3 ' -O-allyl-dGTP-allyl-CyS. 
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3' -O-allyl-dCTP-allyl-Bodipy-FL-SlO. 3 ' -O-allyl-dATP- 
allyl -ROX and 3? -O-allyl-dirrP-allyl-ReG. 

22. The method of claim i. wherein the four nucleotide 
analogues in step (a) are 3' -O-allyl-dGTP-allyl- 
Bodipy-FL-SlO. 3'-O-allyl-dCTP-allyl-Bodipy-650, 3'-0- 
allyl-dATP-allyl-ROX and 3' -0-allyl-dUTP-allyl-R6G. 

2.3. The method of claim l, wherein the four nucleotide 
analogues in step (a) are 3' -O-allyl-dOTP-allyl- 
Bodipy-650. 3'-O.allyl-dCTP-allyl.Bodipy.FL-510, 3'-0- 
allyl-dATP-allyl-Rox and 3 ' -0-allyl-dUTP-allyl-R6G. 

24. The method of claim l, wherein the reversible 
terminators in step (c) are 3 ' -o-allyl-dGTP, 3'-o- 
allyl-dCTP, 3' -O-allyl-dATP and 3 ' -O-allyl -dUTP. 



25 



The method of claim l, wherein the reversible 

terminators in step (c) are 3' -o-2-nitrobenzyl-dGTP 

3' -O-2-nitrobenzyl-dCTP, 3 ' -0-2-nitroben2yl-dATP and 
3' -O-2-nitrobenzyl-dUTP. 



26. The method of claim i, wherein the DNA polymerase is a 
9'^ polymerase or a variant thereof. 

27. The method of claim 1, wherein the DNA is bound to 
the solid substrate via a polyethylene glycol molecule 
and the solid substrate is azide- functional zed or the 
DNA is immobilized on the solid substrate via an azido 
ixnkage, an alkynyl linkage, or biotin-streptavidin 
interaction; 
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wherein (i) the four nucleotide analogues in step (a) 
are 3' -0-allyl-dGTP-allyl-Cy5, 3' -O-allyl-dCTP-allyl- 
Bodipy-FL-510, 3 ' -0-allyl-dATP- allyl-ROX and 3'-0- 
allyl-dUTP-allyl-R6G, (ii) the four nucleotide 
analogues in step (a) are 3 ' -O-allyl-dGTP-allyl- 
Bodipy-FL-510, 3' -O-allyl-dCTP-allyl-Bodipy-650, 3' -O- 
allyl-dATP- allyl-ROX and 3' -O-allyl-dUTP-allyl-RSG, or 
(iii) the four nucleotide analogues in step (a) are 
3' -O-allyl.dGTP-allyl.Bodipy-650, 3 ' -O-allyl-dCTP- 

allyl-Bodipy-FL-510, 3' -O-allyl-dATP-allyl-ROX and 3'- 
0-allyl-dUTP-allyl-R6G; and 

wherein the reversible terminators in step (c) are 3'- 
O-allyl-dGTP, 3 ' -O-allyl-dCTP, 3 ' -O- allyl-dATP and 3'- 
O-allyl-dUTP. 

28. A kit for performing the method of claim 1, comprising, 
in separate compartments, 

(a) nucleotide analogues of (i) GTP, (ii) ATP, (iii) 
CTP and (iv) TTP or UTP, wherein each analogue 
comprises (i) a base selected from the group 
consisting of adenine, guanine, cytosine, thymine 
or uracil, or an analogue thereof, (ii) a 
deoxyribose, (iii) a cleavable moiety bound to 
the 3* -oxygen of the deoxyribose and (iv) a 
unique label bound to the base via a cleavable 
linker, 

(b) reversible terminators comprising a nucleotide 
analogue of (i) GTP, (ii) ATP, (iii) CTP and (iv) 
TTP or UTP, wherein each analogue comprises (i) a 
base selected from the group consisting of 
adenine, guanine, cytosine, thymine or uracil, or 
an analogue thereof, which base does not have a 
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unique label bound thereto, (ii) a deoxyribose, 
and (iii) a cleavable moiety bound to the 3'- 
oxygen of the deoxyribose; 

reagents suitable for use in DNA polymerization; 
and 

(d) instructions for use. 

29. The kit of claim 28, wherein the nucleotide analogues 
of part (a) are 3- -O-allyl-dGTP-aHyl-CyS, 3'-0-allyi- 
dCTP.allyl-Bodipy-FL-510, 3' -O-allyl-dATP- allyl-ROX 
and 3' -0-allyl.dUTP-allyl-R6G. 

30. The kit of Claim 28, wherein the nucleotide analogues 
of part (a) are 3' -O-allyl-dGTP-allyl-Bodipy-FL-SlO 
3' -O-allyl-dCTP-allyl-Bodipy-650. 3' -O-allyl-dATP- 

allyl-Rox and 3 ' -0-allyl-dUTP-allyl-R6G. 

31. The kit of claim 28, wherein the nucleotide analogues 
of part (a) are 3 ' -O-allyl-dGTP-allyl-Bodipy-650 3' - 
O-aHyl-dCTP-allyl-Bodipy-FL-510, 3 ' -O-allyl-dATP- 

allyl-ROX and 3 ' -0-allyl-dUTP-allyl-R6G. 

32. The kit of claim 28, wherein the nucleotide analogues 
of part (b) are 3' -o-allyl-dGTP, 3 ' -o-allyl-dCTP, 3'. 
O-allyl-dATP and 3 ' -0-allyl-dUTP. 

33. The method of claim 28, wherein the reversible 
terminators in step (c) are 3 ' -o-2-nitrobenzyl-dGTP, 
3' -O-2-nitrobenzyl-dCTP, 3' -0-2-nitrobenzyl-dATP and 
3' -0-2-'nitrobenzyl-dUTP. 
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Figure 1 
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Figure 2 
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Figure 3 




wo 2008/069973 



PCT/US2007/024646 



4/12 



Figure 4 
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Figure 5 
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Figure 6 
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Figure 8 
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Figure 9 
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Figure 10 



^ CO ir> 
O 

I , >s 



I I 

ur> CO 




Al|MI«|U| 



0) 
Q. 

E 



CD 
0> 
Q. 
g> 

CO 
O 



I 

a. 



cu 

6 

CO 



Q 



CO 



a. osT 

I 



CO 
(0 

O UJ 

E 



e 



<? in 



CD 
P 

O 

t 

O 

o 
p 

o 
o 
o 
< 



< 



< 



E Jo 

a 



«0 

3 i 



CO in 
• ■ 



o p 
< K S 



tn CO 



C 
(0 




wo 2008/069973 



12/12 



PCT/US2007/024646 



Figure 11 



q!co Co 

1 ' * 

°P >. CD 

o o 



I I 

CD CO 




Q. 

E 

(D 

o 

<3 

0) 

Q. 

g> 

(0 
O 

o 



Q. 



7 
O 

I 

CO 



o 

'co 
c 

X 
LU 




CO in 



3 

O 
< 

< 



m CO 



O 
O 



<3 «» 



0) 

e 



0} 

E 



CO lO 



o 
o 



CM 



I 



I 

CO 



CQ 



Q. 



Q> 

o 




04 




CO 



E 

Q. 



I 



II 

o 



